FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGC AGGT C T GAAGACTAACAT TT TGTGAAGT T GTAAAACAGAAAACCTGT TAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CATACAT TACT GCAGTAACACT CCACCATATAGAC CCGGC T T TACCTT AT ATCAG TGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCAT TTAT GT T CGT TATAAGCAAGT TCAT GCT CTGAGTCC TGAAGAGAACGT TATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TAT AT AT G T T T GT T CAGAC CAT CC T T T CC TAC C AAAT G CAGCC C AAAAT CC AT GGC AAAC AAG T C 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
AT CAGT TT TGCACAG T GGCAAT T T T GGGAC TGAT T TAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGT TATGTGCT TCACATGAT CACTACTGCAGCAGAAT GGTCT AT G T CAT TT TCCT T CT T T 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 

T GGAT TAACCCTC TATGACAC TGCACC T TGCCC TAT TAACAATGAAC GAACACGGCTACT T T CCA 
GAGATAT T TGA TGAAAGGATAAAATAT TTCTGTAAT GAT TATGATT CTCAGGGAT TGGGGAAAGG 
T TC ACAGAAGT TGC T TAT TCT TC T C TGAAAT T T TCAAC CACT TAATCAAGGCT GACAG TAACAC T 
GAT GAATGCTGATAATCAGGAAACATGAAAGAAGCCAT TTGATAGATTAT TCTAAAGGATATCAT 
CAAGAAGACTAT TAAAAACACCTAT GCCTATACTTT TT TATC TCAGAAAAT AAAGTCAAAAGAC T 
ATG 



App_BD= 10063 579 



Page 153 of 320 



FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX{S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC I AT I YVR YKQVHALS PEENV 1 1 KLNKAGLVLG I LS CLGL S I VANFQKT T LFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVSANLHGLTLYDTAPCPINNERTR 

LLSRDI 



Important features: 



Type II transmembrane domain: 



amino acids 13-33 



Other Transmembrane domains: 



amino acids 54-73, 94-113, 160-180, 122-141 




N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105/ 124-130, 183-189 



AppJD-10063579 



Page 154 of 320 



FIGURE ft 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GAT TTGGGTGGCTTT T CTTCATGCGCCAATTGT T TAAAGAC TATGAGATACGTCAGTATGT TGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGT AT T GAATAGCAGC T CC CGTT AT T TT C ACT G GAAAAT GAACCT GT GT GT AAT T CT GCT GAT C CTG G 
TT T TCAT GG TG C CTT T T TAC ATT GGCT AT TT TAT T GT GAG CAAT AT C CGAC T ACT GCATAAAC AACGA 
* CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 

CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
f_ \ CT CAGGAATGT GACTGACACGGAT ATTC TAGC C CT GGAACGGCGACT GCTGCAAACCATGGATATGAT 

fj.? (^TAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGT T 

W i 

%\ AAC CAT CAGGT T T CT GGGGAATGATAAAAAGT GTTACCACTTCAGCAT CAGGAAGTGAAAATCT TACT 

<JJ CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTT CTGGAAACAGCTGATCT 

8 AT AT GCT AC CAAGGAGAGAAT AGAATAC T CCAAAACC T T CAAGGGGAAAT AT TTT AAT TTTCTTGGTT 

M ACT TTTTCTCTATTTACTGTGTTTGGAAAATTT TCAT GGCTACCATCAATATTGTTTTTGATCGAGTT 

GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 



\\\ 

£1 TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 



5 f * 



CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATAC CGCACCATAATCACTGAAGTCC TTGGAGAACT GCAGTT CAAC TT CTATCACCGTTGGT TTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GAT ATAAGAGG GGGGAAAAAT G GAACCAG GGC C TGACAT T T T ATAAACAAACAAAATGC T AT GGTAGC 
ATT TTTCACCTTCATAGCAT ACTCCTTCCCCGTCAGGT GATACTATGACCAT GAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
C TAT GGT AG CT GAGCCAAACACGT AGGAT TT C C GT T T T AAG GT TCAC AT GGAAAAGGTTATAGC T TT G 
CCT TGAGAT TGACTCAT TAAAAT CAGAGACTGT AACAAAAAAAA&AAAAAAAAAAAAGGGCGGCCGCG 
ACT CTAGAG TC GAC C T GCAGAAGC TTG GC CGC CAT G G C C CAACTT GT T TAT T GCAGCTT ATAAT G 
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FIGURE 4 



f't 
si? 



*• * V * 



MS FLIDS S IMI TSQ I L FFGFGWLFFMRQL FKDYE I RQYWQVI FS VT FAFSCTMFEL 1 1 FE I LGV 
LNS S SRYFHWKMNLCVI LL ILVFMVPFYI GYFI VSNI RLLHKQRLLFS CLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAWCPYTYMSYFLROTTDTDIIJ^ERRLLQ 
TMDMIISKKKRM&MARRTMFQKGEVHNKPSGI^^ 

LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSALSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 283-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE 5 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGT.TCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCT TGTGAGCAAAAAGGCGAACCAGCAGCTGAAT TTGACAG 
AAGCTAAGGAGGCC T GTAGGC TGCT GGGAC TAAGT T TGGCCGGCAAGGACCAAG TTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT T ATCACCACCAAAGAT CCCAT AT T CAACACTCAAACTGCAACACAAACAACAGAAT T TAT T GT 
M ! CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
f\ C T GCTCCAGC T TCCACT TC TAT TCCACGGAGAAAAAAAT T GATT TGT G T CACAGAAGT T T T TAT G 

?;? { GAAACT AGCACCAT G T C TACAGAAACT GAACCAT T T GT T GAAAATAAAGCAGCAT TCAAGAAT GA 

hi AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
& CAGCTGGTCT TGGAT TT T GCTATGT CAAAAGG TATGT GAAGGCCT T CCCT TT TAGAAACAAGAAT 

.UJ CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
\V\ T GAGGAATCAAAGAAAAC TGATAAAAACCC AGAAGAG T CCAAGAGT CCAAGCAAAAC TAC CGT GC 

Zl GATGCCTGGAAGCTGAAGTTTAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTT TCTT T 

'** CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 

CCCT TGGT TC CT AAC T GGAATCAGCTCAGGAC TGCCATT GGAC TATGGAG T GCACCAAAGAGAAT 
a GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
CJ TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
Tfl CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
%\ AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
WJ GCT CTGAAAGAGAAACACGTATCCCACCT GACATGT CCTT C T GAGCCCGGTAAGAGCAAAAGAAT 

UJ GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
£1 GCTAAAATAAAGAAATAGAACAAGGC T GAGGATACGACAGTACAC T GTCAGCAGGGAC TGTAAAC 

fj'| ACAGACAGGGTCAAAGTGT TT TCTCTGAACACATTGAGTT GGAATCACTGTTTAGAACACACACA 

f 5:? CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 

AAAATAAAAACT CT TATAAAT TT C T ATTT T TAT C TGAGT T ACAGAAATGAT TAC TAAGGAAGAT T 
AC T CAGTAAT T T GT T T AAAAAGT AATAAAATTCAACAAACATT T GCT GAATAGC TACTAT AT GT C 
AAGTGCTG TGCAAGGT AT TACAC TCTGTAATTGAAT AT TAT TCCTCAAAAAATT GCACAT AGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TATTATTAACATACCTAAGAAGTACATTGTTACCTCTATATACCAAAGCACATTT TAAAAGT GCC 
AT T AACAAATGT AT CACTAGCCCTCCTT T T TCCAACAAGAAGGGACT GAGAGATGCAGAAATAT T 
T G TGACAAAAAAT TAAAGCAT T TAGAAAAC T T 
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FIGURE 6 



MARCFS LVLLLTS I WTTRLLVQGS LRAEEL S I QVSGRIMG I TLVSKKAN QQLN FTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGOTGDGFVVT SRI SPNPKCGKNGVGVLI WKVPVSRQFAAYCYN 
SSDTWTNSCI PEI I TTKDPI FNTQTATQTTE FIVSDST YSVASPYST I PAPTTTPPAPASTSI PR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-rayristoylation site. 

amino acids 23-29, 65-71, 234-240/ 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCGCGCATCTGCACCCGCAGCCCGGC 

GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 

CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 

CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 

GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 

GGAGGACACGCAGCACAAAT TGCGCA.GCGCGG TGGAAGAGATGGAGGCAGAAGAAGCTGC T GCT A 

AAGCATCATCAGAAG T GAACC TGGCAAACT TACCTCCCAGCTATCACAAT GAGACCAACACAGAC 

ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 

TGGACAAATGGTC T T T TCAGAGACAGTTATCACATC TGT GGGAGACGAAGAAGGCAGAAGGAGC C 

ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 

ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 

GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 

ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGG 

ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 

CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 

GAGATCC T GC TGCCCAGAGAGGTCCCCGATGAGTAT GAAGTTGGCAGCT T CATGGAGGAGGT GCG 

CCAGGAGC TGGAGGACCTGGAGAGGAGCC TGACTGAAGAGATGGCGC TGGGGGAGCC T GC GGCT G 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 

TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 

CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 

CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 

GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 

TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 

TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 

AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 

CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 

CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 

T T T T C T T GAGG C AT G C ACAT C T G G AAT TAAGG T C AAAC TAAT T C TCACAT C CCT C TAAAAG T AAA 

CTAC TGT TAGGAACAGCAGTG T T CTCACAG TGTGGGGCAGC CGTCC T T C TAATGAAGACAAT GAT 

ATT GACAC TG TC CCTCTT TGGCAGTT G CAT TAGTAACTT T GAAAGGTATAT GAC T GAGCGTAGCA 

TAC AGGT TAACCTGCAGAAACAGT ACT TAGGTAAT T GT AGGGCGAGGAT TATAAATGAAAT T T GC 

AAAATCAC T TAGCAGCAAC TGAAGACAAT TAT CAACCACGTGGAGAAAATCAAACCGAGCAGGGC 

T GT GT GAAACAT GGT T GTAAT AT GC GACTGCGAACACT GAACTCTAC GC CAC TCCACAAAT GAT G 

T T T T CAGG TG T CAT GGACTGT T G CCACCAT GTAT TCAT CCAGAGT T CT TAAAGT T TAAAGT TGCA 

CAT GATTG TATAAGCATGC TT TC T T T GAGT TT TAAATT AT G TATAAACATAAGT T GCAT TTAGAA 

ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS S EWLANLPPS YHNETNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECI IDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 



amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 




N-glycosylation site. 



amino acids 96-100, 106-110, 121-125, 204-208 



7 / 



Casein kinase II phosphorylation site. 




327-331 



N-myristoylation site. 

amino acids 202-208, 217-223 



Amidation site. 



amino acids 140-144 
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FIGURE Q 

CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCA^TAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGGCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGC TCAGAGGAG T GCAGAACCCTGTAGCCCGCT GCATCAT GT GCTGTT T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
T TGAGAGGCT GAGGCGGGCGGAT CACCT GAGT CAGGAGT T CGAGACCAGCC TGGCCAACAT GGT G 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATTTTATTAflAGATATTTTGTTAACTC 
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FIGURE 10 

RTRGRTRGGCEKVPINTSCNPTAHLWSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWIiALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EYI DHKLRGVQNPVARC IMCCFKCCLWCLEKFIKFLNRNAYIMIAI YGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBRO 02c family proteins. 

amino acids 27 6-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCAXCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGT CCAGGACGC CCAGCCCAACTCGGGTC TGCT GCAGGCCT CGGTCATCACCCTCT ACACCAT G 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAAGTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLL.GYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYI PDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVLAS LHVMMTLTNWYKPGETRKMI S TWTAVWVKI CAS WAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GG T TGGAAAAAGACTCC T GTAAC CC TCCTCCAGG ATGA ACCACCT GC CAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGC CAGGAT TGAGTCCTATGAAG GAAGGGAAAAGAAAGGCATATCT GATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGCATT GAGAACACAT TAGAGAAGGAGGT GAT GCAGTAT GACT AC TAT TC T T CATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACT GCGCCATTGGT GGGCAATAGCGT T GACAACGGCAGTGACCAGTGCCT T T T TAC TAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
rfi C TT GCC TGGAT TGAGACGT GGT T CCT GGAT TT CAAAG T GT TACC TCAAGAAGCAGAAGAAGAAAA 

y| CAGACTCC TGATAGT TCAGGATGCT TCAGAGAGGGCAGCAC T TATACC TGGTGGTCT T TC TGAT G 

GTCAGT T T TAT T CCCCTCCTGAATCCGAAGCAGGAT CTGAAGAAGCT GAAGAAAAACAGGACAGT 

% \ 

2 GAGAAACCACTT TTAGAACTATGAGTACTACTT TTGT TAAATGTGAAAAACCCTCACAGAAAGTC 

hi X 

J" ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 

C5 CACT GC TGGC T T TAT T GAACAGCTAAT AAAGAT T TATTTAT TG TAAT ACC T CACAAACGT TGTAC 

CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 

AAAAGAC T TAATATATTGAAGTAACACT T T TT TAGT AAGC AAGATAC CTT T T TAT T TCAATT CAC 

W 

fi\ AGAATGGAAT T TT TT TG TT TCAT GT CTCAGAT T TAT T T TGTATT TC T T T T T TAACACTCT ACAT T 

TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 



; s \ 

,S 



ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTT GCAAAAGGGGAAGAAAGGAAT TGCGAATACATGTAAAAT GTCACCAGACAT T TGTAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACT T AAAC CAT TCAT AT CATGT T TCC T TTGCGT TCAGCCAAT T TCAATTAAAATGAA 
CTAAATTAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNI 

VTLLWI IELNVNGGIENTLEKEVMQYDYYSSYFDI FLLAVFR FKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLLAECVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYS PPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 0 

Transmembrane domains : 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 

CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 

GGTCCCTCTGCT6CTGCCGCTGCTCCTGCTACT6GCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 

CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 

CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 

AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 

TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 

CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 

CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 

AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 

CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 

GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 

TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 

CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 

TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 

TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 

TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 

CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCAGCGTCCACCTGCCTCA 

ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 

GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 

GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 

GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 

ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 

CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 

ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 

GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 

ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 

GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 

CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGGCCTGCCCTCCGCAACGTGCAGTC 

CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 

T CCAGGCGGACCCT GGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 

GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 

T TTAGGAACATGTT TTGCTTTTT TAAAATATATATAT T TATAAGAGATC C T T TCCCAT T T AT TC T 

GGGAAGATGT TT TT CAAAC T CAGAGACAAGGACT TT GGTT T T T GTAAGACAAAC GAT GATAT GAA 

GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGIiYVFENGIT 
MLDAGS FAGLPGLQLLDLSQNQIAS LPSGVFQPLANLSNLDLTANRLHE I TNE TFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRRLPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLPEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAAROT 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLE LE GVKVPLE PGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQS PLHAKP YI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature, 
amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites* 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAAT C C T TAGAT TCCAAGACTAC T T T GACAT CAGATGAGT CAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGT TGC TGGTCAAATATTTCTTGAT TCAGAAGAATCTGA 
AT T AGAATCC TCTAT T CAAGAAGAGGAAGACAGCCT CAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATATCAGCT T T C T AGAGT CT CCAAAT CCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
P k TTT C C TAGAT AAGGAGTAT GAT GAAT GTACATCAGATGGGAGGGAAGATGGCAGACTGTGG T GT G 

' I 

2? C TACAACC TATGACTACAAAGCAGAT GAAAAG TGGGGCT T T TG TGAAAC T GAAGAAGAGGCT GC T 

» » 

AAGAGACGGCAGAT GCAGGAAGC AGAAATGATGTAT CAAAC TGGAAT GAAAATCCTT AAT GGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATC GG TAT CT CCAAAAGGCAGCAAGCAT GAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
. GCAGCGAGAGAGAT GT T TGAGAAGC TGACT GAGGAAGGCT C TCCCAAGGGACAGAC T GCT CTTGG 



" ss * CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 



i*t TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 
1 11 AAT AAT AT TAACAT CAGAAGAAT T TGTGGTT TATAGCGGC CACAAC T T T T T CAGC TT T CATGAT C 

Wf CAGAT TTGCT TGTAT T AAGACCAAATATT CAG T TGAAC T T CC TTCAAATTC TTGT TAAT GGATAT 

*M AACACAT GGAAT CT ACATGTAAATGAAAGT TGGTGGAGTCCACAAT T T TTCT T TAAAATGAT TAG 

T T TGGCTGAT TGCC CCTAAAAAGAGAGAT C TGATAAAT GGCTCTTTT TAAAT T T T CT CT GAGT T G 

Hi 

GAAT TGTCAGAATC AT TT T T TACAT TAGAT TAT CAT AAT T T TAAAAAT TTT TCT T TAG T TT TTCA 
AAAT T TTGTAAATGGT GGC TATAGAAAAACAACATGAAATATTATACAATAT T T TGCAACAATGC 
CCTAAGAAT TG T TAAAAT TCATGGAG T TAT TTGTGCAGAATGACTCCAGAGAGC T C TACT TT CT G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACT CAT T T TTAATAAAAT TATGTCTAAGATT AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSIASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
E S S I QEEEDSLKSQEGE SVTEDI S FLE S PNPENKDYEEPKKVRKPALTAIEGTAHGE PCHFFFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMyrYQTGMK 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 



N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 



Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 




N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 



Tyrosine kinase phosphorylation site. 



amino acids 220-228 
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FIGURE iq 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTAT TATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTT TTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT TT TT AT CAC TGGATGTGACTCGGGC T T TGGAAACT TGGCAGCCAGAAC T T T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCT CAGAGAGAC T TCGTAC T GTGCTT CT GGAT GT GACCGAC CCAGAGAATGTCAAGAGGACT G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TAT GCAGTGGAAGGT T TCAATGACAGCT TAAGACGGGACATGAAAGCT T T T GGTGTGCAC G T CT C 
ATGCAT TGAACCAGGAT TG T TCAAAACAAACT TGGCAGAT CCAGTAAAGG T AAT TGAAAAAAAAC 
T CGCCATT TGGGAGCAGC TGTC TCCAGACATCAAACAACAATAT GGAGAAGG T TACAT TGAAAAA 
AGTCTAGACAAACT GAAAGGCAATAAATCC TATGTGAAGATGGACCT CTCT CCGGTGGTAGAGT G 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
G CAGAGC T GGC T AAT C CCAAGGC AGT GTGA CT CAG C TAAC CAC AAAT G T C T CC T C CAG GC TAT G A 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CAT T TAGAT C G T GC T TAT T T GGAT T GCAAAAGGGAGT C CC ACCAT C G C T G G TGG TAT C CC AG GGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATC T C TTAT CT AAATAT TAAAAGATAAGT CAACCC AAAAAAAAAAAAAAA&AAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLF^VLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPEOTKRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEWLFGLISVTLNMLPLVKKAQGRVIWSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVWMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELMPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site, 
amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT TACTGAT TCCCAAATGGATGATGT TGAAGT T GT T TATACAAT TGA 
CAT T CAGAAATATAT T C CATG C TAT CAG C T T T T TAG C T T T T AT AAT T C T TCAGG CGAAG T AAAT G 
AGCAAGCACTGAAGAAAATAT TATC AAAT GTCAAAAAGAATGT GGTAGGT T GGTACAAATTCCGT 
CGT CATT CAGATCAGAT CATGACGT TT AGAGAGAGGC T GC TTCACAAAAACTTGCAGGAGCAT T T 
T TCAAACCAAGACCT TGT TT TTCTGCTAT TAACACC AAG T ATAATAACAGAAAGCTGCT CTACT C 
M AT C GAC T G G AAC AT T C C T T AT AT AAAC C T C AAAAAG GA C t T T T T C AC AG G G T AC C T T TAG T G G T T 



r 5 ? 



; s : 



GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T TTTAGCCGAGCAGTACAAACACACAGC T C TAAAT T T T T T GAAGAAGAT GGATCC T TAAAGGAGG 
T ACATAAGATAAAT GAAATGTAT GCT T CAT TACAAGAGGAAT TAAAGAGTATATGCAAAAAAGT G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATCTGACCT TAATGGTAGAACACAC TGACATTCCT GAAGCTAGT CCAGC TAGTACACCAC 
AAAT CAT TAAGCAT AAAGCCT TAGAC T TAGATGACAGAT GGCAAT T CAAGAGAT C TCGG T TGT TA 
GATACA.CAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGAT GAAGGGT T T TGGT GAATAT TCACGG T CT C 
CTACATT T TGAT CCT T T TAACCT TACAAGGAGAT t T T T T TAT T TGGCTGAT GGG TAAAGCCAAAC 

ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAACT CT TAGTGCAT T TACT TCACAAAGT ACT TT TTCAAACATCA 
GAT GC TT T TAT T TC CAAACCT TT TT T T CACCT T TCACT AAGT TGT TGAGGGGAAGGC T TACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGT CAGGAGGAT TGAT TGAAGCTAGGAGT TAGAGACCAGC C TGGGCAACG TAT T 
GAGACCATGTCT AT TAAAAAATAAAAT GGAAAAGCAAGAATAGCC T TAT T T TCAAAATAT GGAAA 
GAASlTTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CATTATGGCTAGAGTT GCCAGAT AAAATGC TGGATATC AT GCAATAAAT T T GCAAAACAT CATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNWKNWGWYK^ 

VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEBCRRGA 
Q I QAAREKN I QKDPQENI FLCQALRT FFPNS E FLHS CVMS LKNRHVS KS S CNYNHHLDWDNLT L 

MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 



Important features: 
Signal peptide: 



amino acids 1-19 



N-glycosylation sites. 



amino acids 75-79, 322-326 



N-myristoylation site. 




amino acids 184-154 



Growth factor and cytokines receptors family. 



amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 

GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 

GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 

GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 

CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 

TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 

GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 

AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 

AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 

CAAGAGAT GGAGGAGAAGT ACAGCCT GCAAAGTAAAGAT GATT TCAAAAGT GT GGTCAGCGAACAGTGCAATCAT T TG 

CAAGCTGT CTTTGCTT CACGTT ACAAGAAGTTTGAT GAATT CT T CAAAGAACTACTTGAAAATGCAGAGAAAT C C CTG 

AATGATAT GTTT GTGAAGACAT ATGGCCAT TTATACATGCAAAATT CT GAGCTAT TTAAAGATCT CTT C GTAGAGT T G 

AAACGTTACTACGTGGTGGGAAATGT GAACCTGGAAGAAAT GCTAAAT GACTT CT GGGCTC GCCT CCT GGAGCGGATG 

TT CCGCCT GGTGAACT CCCAGTACCACTTT ACAGAT GAGT ATCT GGAAT GT GT GAGCAAGTATACGGAGCAGCTGAAG 

CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 

AT GAT CTACTGCTCC CACTGCCGGGGTCT CGTGACT GTGAAGCCAT GTTACAACTACT GCT CAAACAT CAT GAGAGGC 

T GTTT GGC CAACCAAGGGGATCT CGATTTT GAAT G GAACAATT T CATAGAT GCT ATGCTGAT GGT G GCAGAGAGG CTA 

GAGGGTCCTTTCAACATTGAAT C GGT CAT GGAT CCCATC GATGT GAAGAT TTCT GATGCTATTAT GAACAT GCAGGAT 

AATAGTGT TCAAGT GT CTCAGAAGGTTTT CCAGGGAT GT GGACCCC CCAAGCC C CTCC CAGCTGGAC GAATT TCT CGT 

TCCAT CTCT GAAAGT GCCTTCAGTGCTC GCTTCAGACCACATCACC CCGAGGAACGCCCAACCACAGCAGCT GGCACT 

AGTTT GGACCGACTG GTTACT GATGT CAAGGAGAAACTGAAAC AGGCCAAGAAATTCT GGT CCTCC CTTCCGAGCAAC 

GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATG 

CT GTT TGC^GTGACAGGAAAT GGATT AGCCAACCAGGGCAACAACC CA.GAGGT CCAGGTT GACACCAGC^ 

ATACT GAT CCTTCGT CAAAT CAT GGCTCTT CGAGTGATGACCAGCAAGAT GAAGAAT GCAT ACAAT GGGAACGAC GT G 

GACTT CTTT GATATCAGTGATGAAAGTAGT GGAGAAGGAAGTGGAAGT GGCTGT GAGTATCAGCAGTGC CCTT CAGAG 

TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 

CAGG CCTACCTC CTCACT GT CT T CTGCATCTT GTTCCTG GTTAT GCAGAGAGAGT GGAG ATAA TT CTCAAACT CT GAG 

AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATC^CTTTTCTACCATCCTAGTGACTTTGCTTTTTA 

TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 

AGGAAAAGGGACT GT GCATT GAGTTGGTTCCTGCTC C CCCAAACCAT GTT AAAC GTGGCTAACAGT GT AGGT ACAGAA 

CT ATAGTT AGTT GTGCATTT GT GATTTTAT CACT CTATTATTT GT TT GTAT GTTTTT TT CT CATTTC GT TT GT GG GTT 

TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 

T CTGAAATATTAAAT AGCT GTACAGAAGCAGGTTTT ATTTATCAT GTTAT C TTAT TAAAAGAAAAAGC C CAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSSQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFTOLKRYYWGNWLEEMLNDFWARLLEBHFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGIAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
E KLKQAKKFW S S LP S NVCNDERMAAGNGNE DDCWNGKGKSRYL FAVT GNGLANQGNNPEVQVDT S 
KPDI L I LRQ IMALRVMTS KMKNAYNGNDVDFFD I S DESSGEGS GSGCEYQQCPSE FDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 

Important features : 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 2S 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGC TTC TAGGAGATGGC TCCAGGAAGGCGGCCAAGAAT G T GAG TGCAAAGAT T GGT TC C T GAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAATGT GAAGAAAACAAGACACCAAAGGCACCAC AGAAAGCGAAACAAGCAT TCCAGA 
GCC TGCCAGCAAT T TCT CAAACAAT G T CAGCTAAGAAGC T T TGCTCT GCCT T TG TAGGAGC TCT G 



U AGC GCCCACTC T TCCAAT TAAACAT T CT CAGCCAAGAAGACAGT GAGCACACCTACCAGACACT C 



TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
\*l AGC TAGTGTCAT T TAACCTTAAATGCAATCAGGAAAGT AGCAAACAGAAGT CAATAAATAT TT T T 

-A AAATGTCAAAAAAAAAAAAAAAAAA 

f i 

» »■ 

' r ",i :r 
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FIGURE 26 
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MKVLISSLLLLLPmmSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRK™ 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 

G G ACGC CAGCG C C T GC AGAGG C T G AGC AGG GAAAAAG C C AG T GC C CCAG CGGAAGC ACAG C T CAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
T CT TT TT C TGGGAGCATGTGGCAGAAC CATATGGAAGCTGGGCC T TCATG T GGCAGCAAGT T T TC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACAT TCAT GTACCACCTAC TAGTCCC TC TC TCCCCAACC TCT GCCAGGGC 

AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAA?^AGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACT CAAAGAGGC AGAGG TT TTGT TCTCAAATAT TTTT TAATAAATAGACGAA 
ACCACG 



AppJD- 10063 579 



Page 179 of 320 



FIGURE 28 

MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSKRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQIJUXdSMDVWCTLVLCSVQSPRKVXQEVRRVLRPGGVLFFWEHVAE 

KH I GDGCCLTRETWKDLENAQFSE I QMERQPPPLKWLPVGPH IMGKAVKQS FPS SKAL ICS FPSL 
QLE QATHQ P I YLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATGC TGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTTTAT GAT TCTTATACTAATTTATACAAAGATAT TAAGGCCCTGT TCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
C CT CCAAGCAAGTCAT TTCCCT TAT T TAACCGATG TG TCCC TCAAACACC T GAG TGC TACTC CCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCGATGATGTTTACC 
T TCAGAT TCATCAC CACCC TTC T GGT T CACAT T T TCAT T T CATTGGT TAT T T TGGGAT TGT T GT T 
TGTCTGCGGTGTTT TATGGTGGCTGTAT TATGAC TATACCAACGACC TCAGCATAGAAT T GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
I"* AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 

» •* 

% TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 

i l 

?Vi ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 

{ » r 

y j AAT TGGCC TCAT CT GGACTAG T GAAT TCATCC T T GC G T GC CAGCAAA.TGAC TATAGCTGGGGCAG 

y] TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 

■ * * 

^ GATT CCGAGAAT CAT T GTCAT GTACAT GCAAAAC GCACTGAAAGAACAGCAGCATGGTGCAT TGT 

p'i CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 

\f\ AACCAGAATGCATATAC TACAACT GCTAT TAATGGGACAGAT T TC T GTACAT CAGGAAAAGATGC 

£| ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 

* T T T T T C TAGGAAAGG TGT TAG T GGTG TG T TTCAC TGT TT T TGGAGGAC TCATGGC TT TTAAC TAC 

■ I 

AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 

v f - 

\ -5 if 

CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAATGAT GGATCGTCAGAAAAGCCC TACT T TAT GGAT CAAGAAT T TC TGAGTT TC GT A 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACT CAT TAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAGATAGATACC CATTTAGGTAT CTGTACC TGGAAAACAT T 
T CCT TC TAAGAGCCATT TACAGAAT AGAAGAT GAGACCACTAGAGAAAAG T TAG T GAAT T TT TT T 
T TAAAAGACC TAATAAACCCTATTC T T CC TCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLAIYM^ 

LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSA 

I L I FFW VL WAVLLS LGTAGAAQVME GGQVE YKPLS GIRYMWS YHL IGLIWTSEFI LACQQMT I A 

GAWTCYFNRSKNDPPDHP ILS S LS I LFFYHQGT WKGS FL I S WRI PRI I VM YMQNALKEQQHG 

ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 

FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 

VDLETNDGS SEKPYFMDQE FL S FVKRSNKLNNARAQQDKHS LRNEEGTELQAI VR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 3 3 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT T TGTAT GGGAT TTGTGAT GCAGGAAAGCC TAAGGGAAAAAGAATAT TCAT TCT GT GT GGT 
GAAAAT T T TTT GAAAAAAAAAT T GC C T TCT TCAAACAAGGG TG TCAT T CTGATAT T TATGAGGAC 

TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
T ACACAGT GG T GT GCT T GAT AAT T CAGGAGGGAAAATACT TGT TCGGAAGGT TGCTGGACAGTGT 
GGTTACAAAGGGAGT TAT TCCAACGG T GT C CAATCGT TAT CCC TACCACGATGGAGAGAAT CCTT 
TATCGTCT TAGAAAG T AAACCCAAAAAGGG TGTAAC CTACCCATCAGCTCT TACATAC TCAT CAT 
C GAAAAGT CCAGCTGCCCAAGCAGGT GAGACCACAAAAGCCTATCAGAGGC CACCTAT TCCAGGG 
y. ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
%l C AC C T TGC CAAG G C CAT CCCCTTCTGCTGCTTC T AC C ACC AGCAT CC CC AGACCACAAT C AG TG G 

y GCCACAGGAGCCAGGAGATGGAT CT CT GGT CCAC TGCCAC CTACACAAGCAGCCAAAACAGGCC C 

□ AGAGC TGATCCAGGTATCCAAAGGCAAGAT CC T T CAGGAGCTGCC T T CCAGAAACCTGT T GGAGC 

01 GGAT GTCAGC C T GG GAC T T G T T C CAAAAGAAGAAT T GAG CACACAGT C T T T G GAGC CAG T AT CCC 

y I TGGGAGATCCAAAC TGCAAAAT T GACT T GTCG T T TT TAAT TGATGGGAGCACCAGCAT T GGCAAA 

% CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
f\ C GGT C C AC T GAT GG G TGT T GTC CAG TAT GGAGAC AACC C TGC TAC T C AC T T TAACC T C AAGACAC 

: n ACACGAATTCTCGAGATC TGAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGACTT T CT AAT 

Ul GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
a GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
% % GAC TTGCGAGAGAGT CAGGAAT CAACAT T T TC T TCAT CAC CAT TGAAGGTGCTGC TGAAAAT GAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
W GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
Q ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
jjj GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
%l AGAGT TTGAGATT T CCGACACGGACACGCGCAT CGGGGCC GTGCAGTACAC CTACGAACAGC GGC 

^ TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
FM TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GTCCAAGCCCAACAAGAGGAAGT TAAT GATCC TCAT CACC GAC GGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCAT TGCCACTCACCCCGCCAGAGACCACTCCTTCT TTGTGGACGA 
GT TT GACAACCT CCAT CAGTATGTC CC CAGGATCAT CCAGAACAT T TG TAC AGAGTT CAAC T CAC 
AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAAT GTCT TGT TAT TATTC TT TGCCAT CATGC TTT T TCATATTCCAAAACT T GGAGT TACAAAGA 
T GAT CACAAACGTATAGAATGAGCCAAAAGGCTACAT CAT G TT GAGGG TGCTGGAGAT TT TACAT 
T TTGACAATTG T T T T CAAAATAAAT G T TCGGAATACAGTGCAGCCCT TACGACAGGC T TACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T TT GTCATGACAATGTAGGAAT TGCT GAAT TAAAT G T T TAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAASAAAAAAAAAAAAAAZWVAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYAS YS SVCC3AAVHS GVLDNSGGKI LVRKVAGQSGYKGS YSNGYQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPA^QAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RS GAPNWWMVDGWPTDKVEEASRLARE SGINIFFITIE GAAENE KQYWE PNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 



lf «i FKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEVIATHPARDHSFF 
0] VDE FDNLHQYVPRI IQNICTEFNSQPRN 

Important features : 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 



AppJD=l 0063 579 



Page 186 of 320 



FIGURE 3 5 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTG GATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCGCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTT T C ACTT C AC ACT TC GAGAGC AT T C AAACT GCTCTCATCAAAATCCATT TC TGGTCATTC 
U s TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
Q TGGT GGGG AT AT GAGGT T CTT ACATT TT TCTT AT TAGGCC AAGAGGCT GAAAAGGAAG ACAAAATGTTGGC 

*-•«►' 

Q ATT GTCCTT AGAGGAT GAAC ACCT TCTTT ATGGT GACAT AAT CCGACAAGAT TTTT T AGAGAC AT ATAAT A 

■SH ACCT GACCTTGAAAAC CAT T ATGGCATT C AGGTG GG TAACT G AGTT TT GCCC C AAT GCCAAGT ACGTAAT G 

ill 

AAGACAGAC ACT GATGTT T T C ATCAAT ACTGGC AAT T T AGT G AAGT AT CT T TT AAACCTAAACC ACTCAGA 
GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 

\l 

CTT ACCAGGAGT AT CCTT T C AAGGTGTT CC CT CC AT ACTGC AGTGGGT TGGGT T AT AT AATGT CC AGAGAT 

l ) t 

T TGG TGCC AAGG AT CT AT GAAATGATGGGTCACGTAAAAC CC AT C AAGTT TGAAG AT GTT TAT GTCGGGAT 

'fsn CTGT TT GAATTT AT TAAAAGTGAACAT T C ATATTCCAGAAGACACAAATC TTT T CTTTCT AT AT AG AATCC 

hats 

j|| ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 

Q TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 

yj GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 

O AC AC TGAACT GAAACT CAT GAAAAACCC AG ACTGGAGACT GGAGGGTT AC ACTTGT GATT TAT T AGTC AGG 

I U CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 

ACCAAAC AAT TTGGAC AT GTCATT CT GT AGACT AGAAT TT CT T AAAAGGGTGT T ACTGAGTT AT AAGCTC A 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TATCTT ATGT GGAT T ACCAATT TAAAAAT ATATGTAGT TCTGTGTC AAAAAACT TCTTCACT GAAGTT AT A 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT TAT T T AAAATT ACT TCAACT T T GT GT T TTT AAAT GT TT T G ACGATT TC AAT AC AAG AT AAAAAGGATAG 
T GAATC ATTCTTT ACATGC AAACATT TT CC AGT T AC TT AACT GATC AGTT TATT AT T G AT ACATC ACTCCA 
TT AATGT AAAGTCATAGGT C ATTATT GCAT AT CAGT AATCT C TTGGAC TT T GTT AAAT AT TT T ACTGTGGT 
AAT ATAGAGAAG AATT AAAGC AAGAAAAT C TG AAAA 



App_ID=10063579 



Page 187 of 320 



FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSL 

T LREHSNC S HQNP FLVILVT SHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKE DKMLA 

LSLEDEHLLYGDI IRQDFLDTYNNLTLKT IMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 

NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 

KPIKFEDVWGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAMGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 



Type II transmembrane domain: 



amino acids 20-39 



N-glycosylation sites. 

amino acids 72-76, 154-158/ 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 



amino acids 239-243 




Ly-6 / u-PAR domain proteins. 

amino acids 23-37 



N-myristoylation site. 



amino acids 271-277 
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FIGURE 37 
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1 11 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCA6CTCCTTCTCATC 
TCG TC CTT GCCAAGAGAGT ACACAGTCAT TAATGAAGCCTGCCC T GGAGCAGAGTGGAATATCATGTGTCGGGAGTGC TGTG 
AATAT GAT CAGATTGAGTGCGTC T GCCCCGGAAAGAGGGAAGTCGTGGGTT ATACC ATCCC TTGCTGCAGGAATGAGGAGAA 
TGAG T GT GACTC CTGCC TGATCC ACCCAGGTTGTACCATCTT TGAAAAC TGC AAGAGC TGC CGAAATGGCT CATGGGGGGGT 
ACC T T GGAT GAC TTC TATGTGAAGGGGT TCTAC TG TGCAGAGTGCCGAGCAGGC TGGTACGGAGGAGAC TGCATGCGAT GTG 
GCCAGGT T CTGC GAGC C CCAAAGGGTCAGATTTTGT TGGAAAGCTAT CCCC TAAATGC TCACTGTGAATGGAC CATTCAT GC 
TAAACCT GGGT T T GTCATCCAAC T AAGAT T TGTCATGTT GAGTC TGGAGTT T GAC TACAT GTGCCAGTATGAC TATGT T GAG 
GTTC GTGATGGAGACAACCGC GATGGCCAGATCAT CAAGCGTGTC TGTGGCAACGAGCGGC CAGCTCCTATCCAGAGCAT AG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATC CCC T T GT TT CCATGACGGCACGT GCGTCCTT GACAAGGC T GGAT C T TAC AAGTGTGCCTGC TTGGCAGGC 
TATAC TGGGCAGCGC T GTGAAAATCTCC T TGAAGAAAGAAACTGCTC AGACC C TGGGGGCC CAGTCAATGGGTACCAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
TCTTAGTGGCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATAAAAGCCTGCCGA 
GAACCAAAGATT TCAGACC T GGTGAGAAGGAGAGTTC T TC CGATGCAGGTT C AGTC AAGGGAGACACCATTACACCAGC TAT 
ACTCAGCGGCC T T CAGC AAGC AGAAAC TGCAGAGTGCCCC TACCAAGAAGCCAGCC C TTCCCTTTGGAGATCTGCCCATGGG 
ATACCAAC ATC TGC AT ACC C AGC TCCAG T ATGAGTGCATC TCACCCT TC TACCGC CGCCTGGGCAGCAGCAGGAGGACATGT 



? C TGAGGAC TGGGAAGTGGAGTGGGCGGGCACCATC CTGCAT CCCTATC T GCGGGAAAATTGAGAACATCAC TGCTCCAAAGA 



CCCAAGGGTTGC GCTGGCC GTGGCAGG^AGCCATCTACAGGAGGACC AGCGGGGTGC ATGACGGCAGCC TACAC AAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCAT GATC AAGAC AGCAGAC C TGAAAGTTGTTTTGGGGAAATT C TACCGGGAT GATGACCGGGATGAGAAGACCATCC 
AGAGC CTACAGATT T C TGC T ATC AT TC TGCATCCCAACTAT GACCCCATCC TGC T TGATGC TGACATCGCCATCC T GAAGC T 
CCTAGACAAGGC C CGT ATC AGCAC CCGAGTCCAGCCCATC TGC C TCGC TGCC AGTCGGGATCTCAGCAC TTC CTTCCAGGAG 
*** TCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 
^ f TGGTCAGTGTGGTGGAC TCGC TGCTGTGTGAGGAGCAGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTT 

W CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 

CGAGCATCT CCTGAGCC ACGC TGGCATC TGATGGGACTGGTCAGC TGGAGC TATGATAAAACATGCAGCCACAGGCT C TC CA 
CTGCC TTCACCAAGGTGCTGCCT T T TAAAGACTGGATTGAAAGAAAT ATGAA ATG& ACCAT GC TCATGCACTCCT TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGG G C TTC TGAC TTC AGGGACAAAACTCAGTGAAG G G TGAGT AGACC TC CAT TGCT GG TAGGCTGATGCC GC GTC C A 
CTACTAGGACAGC CAAT TGGAAGATGCCAGGGCTTGCAAGAAGTAAGT T TC T TCAAAGAAGACCATATACAAAAC C T CTC C A 
CTC CACTGACC TGGT GGTC T TC C C C AAC T T TCAGTTATAC GAAT GCC AT CAGC TTGACCAGGGAAGATC TGGGCT TCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKI TGGPGLINGRHAKI GTWS FFCNNS YVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAI YRRTS GVHDGS LHKGAW FL VCS GALWERTVWAAHCVT DLGKVTMI KT ADL 
KWLGKFYRDDDRDEKT I QSLQI S AI I LHPN YDP I LLDAD I AI LKLLDKARI S TRVQP ICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myr i s toylat i on s i te s . 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTC CTACAT CCTCT CATCTGAGAATCAGAGAGCAT AAT CTT CTTAC GGGCCCGT GATTT AT TAAC GT GGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGC CCTTGTAGCTGACAGAAGGTGGCCAGGGAGAAT GCAGCACACT GCTCGGAGAATGAAGGC GCTT CTGT T G C 

TGGTCTT GCCTT GGCT CAGTCCT GCTAACT ACATT GACAAT GT GGGCAAC CTGC ACT T CC T GTATTCAGAACTCT GTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T CACAGC CACGGCTCCCT CCCCAGAGGTTT CT GCAGCTGC CAC CATCT CCTTAAT GACAGAC GAGCCT GGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCT TT GAGAGATC C ACT ATTAGAAGCAGATC ATTT AAAAAAATAAAT C GAGCTT TGAGT GT T CTTC GAA 
GGACAAAGAGCGGGAGTGC AGTT GC CAACC ATGCC GAC GAG GGCAGGGAAAATT CTGAAAACACCACT GCC CCTGAAG 
T CT TT CCAAGGTTGT ACCACCT GATT CCAGATGGTGAAATT ACCAG CATCAAGAT CAATCGAGTAGAT CCCAGT GAAA 
GC CTCT CT AT TAGGCT G GT GG GAGGTAGCGAAACCCCACT GGT C CATATCAT TAT CCAACACATTTAT CGT GATGGGG 
T GATC GCC AGAGACGGCCGGCTACTGCCAG GAGACAT CAT T CT AAAGGTCAAC GGGAT GGACAT CAGCAATGTCC CTC 
ACAACTACGCTGT GCGT CTCCT GCGGCAGC CCT GCCAGGT GCT GT GGCTGACTGT GAT GCGT GAACAGAAGTTCCGCA 
Ll J GCAGGAACAATGGACAGGCCCCGGAT GCCT ACAGACC CCGAGATGACAGCT TTCAT GT GAT T CT CAACAAAAGTAGCC 

yi CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
^ J GT GTGGCAT ATC GACAT GGTCAG CTT GAGGAGAAT GACCGT GT GT TAGCCATCAATGGACATGAT CTT CGATATGGCA 



> * r 



Hi? 



GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGCAATGGC^ 

CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTT GCCTATCTATGT CATCAGTGTT GAGCCCGGAGGAG 
T CAT AAG CAGAGATG GAAGAATAAAAAC AG GT GACAT T TT GTT GAAT GTGGATGGGGT CGAACTGACAGAGGT CAGCC 
GGAGT GAGGCAGTGGCATTATT GAAAAGAACAT CAT C CTC GATAGTACTCAAAGCTTT GGAAGTCAAAGAGTATGAG C 
CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
f|j GGGTCATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACACAGCTGGAAGTC 

TGGGCTT CTGCATTGTAGGAGGTTAT GAAGAAT ACAATGGAAACAAACCTT TTTT CAT CAAATC CATT GTT GAAG GAA 
CACCAGCAT ACAATGATGGAAGAATT AGAT GTGGT GATATT CT TCTT GCT GTCAATGGTAGAAGT ACAT CAGGAAT GA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TTTTATAGAAT CAAT GATGGGT CAGAGGAAAACAGAAAAAT CACAAAT AG GCTAAGAAGTT GAAACACTATATTT ATC 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAA^ 

ACACCTCAGAAAATAT GATT CCAAAAAAATTAAAACTACT AGTTT TT TTT CAGT GT GGAGGATTT CTC ATTACTCTAC 
AACATTGT TTAT ATTT TTTCTATTC^VTAAAAAGCCCTAAAACAACT AAAATGAT T GATTT GTAT AC C C CACTGAATT 
CAAGCT GATTTAAATT TAAAAT TTGGTATAT GCTGAAGTCT GCCAAGGGT ACATT AT GGCCATTT TTAATTT ACAGCT 
AAAAT ATT TTTT AAAATGCATT GCTGAGAAACGTTGCTTT CAT CAAACAAGAATAAATATT T TT CAGAAGTT AAA 
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FIGURE 4Q 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT IS LMTDEPGLDNPAYVSSAEDGQPAI S PVDS GRSNRTRARP FERSTI RSRS FKKI NR 
ALSVLRRTKS GSAVANHADQGRENSENTTAPEYFPRLYHL I PDGE I TS IKINRVDPS E SLS IRLV 
GGSETPLVHI I IQHI YRDGVIARDGRLLPGDI ILKVNGMDI SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDI FQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
I KTGD I LLNVDGVE LTEVSRSEAVALLKRTS S S I VLKALEVKEYE PQEDCSS PAALDSNHISIMAPP 
H SDWSPSWWWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 

V > 

W RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

i i : 

I E | Important features : 

1)1 Signal peptide: 

ammo acxds 1-15 

(s*s ' N-glycosylation sites. 

HI amino acids 108-112, 157-161, 289-293, 384-388 

>■ 

* ■ ? 

hi 

f ? Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-rnyristoylation sites, 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 
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ACCAGGCATTGTATCTTGkGTTGTC^TCA^ 

C TT GCCT GCAGTGAAGCAGAGAGATAGATATTATTC^CGTAATAAAAAACATGGGCTTCAACC T GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCAT TTT GGGGAAGGGAAAAACTCTGAC TAATGAAGCAT CCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTT GGAAGAGGT ACAG GCAGAAAATCCC AAAGT GT C C AGAGGCC GGT AT C GC CCT CAGGAATG T AAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGT TTAATCGAGCCAAACT CTTGAATGT GGGC TATCTAGAAGCCCT CAAGGAAGAAAAT TGGG 
AC T G CT T T ATAT T C C AC GAT GT G GACCT GGT AC C CGAGAAT GACT T T AAC C T T TACAAG T GT GAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGT GTT ACTGCCCTAAGCAGAGAGCAGT TTTT CAAGGTGAATGGAT TCT CTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCT GAAGTGGGTAAATATAC AATGGTCTTCCACACTAGAGACAAAGGCAATGAGGT GAAC GCAGAACG 
GAT GAAGCTCT TACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
TAGTATC TGTGGAACACAAT CCTT TATATAT CAACAT CACAGTGGAT TTCTGGTT T GGT GCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTC CT GGT GAT GTAGAGTATAAAACAGTT GTAACAAGACAGCTTTCTTAGT CATTT T GAT 
CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATT TATTTAAATTTGAAGTAATATAT TATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACT GTTATTCAT TTAT CCTGTACAATCAT CTGT GAAGTGGTGGTGT CAGGT 
GAGAAGG CGTC CACAAAAG AGGG GAGAAAAGG C GACGAAT CAGGACACAGT GAAC T T GGGAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTT TAAAGAGT TT TT G TAAAAT GAT TT T G TAC AAGT AGGATAT GAAT TAG CAG TT TACAAGTT T ACAT 
ATTAACT AATAATAAATAT GTCTATCAAATACCTCT GTAGTAAAAT GTGAAAAAGCAAAA 
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FIGURE 42 



MG FNLTFHLS YKFRLLLLLTLCLT WGWAT SNYBVGAI QE I PKAKE FMANFHKT L I LGKGKT LTN 

EASTKKVSLDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 

RmEKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIF 

DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKWGFSNNYWGWGGED 

DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEWAERMKLLHQVSRWRTDGLSSCSYKLV 

SVEHNPLYINI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 




N-glycosylation sites. 



amino acids 4-8, 220-224, 335-339 



Xylose isomerase proteins. 



amino acids 191-202 
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FIGURE 43 



GCTCAAGACCCAGCAGTGGGACAGCCAGAGAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAAC T T GCAGAGC TGCAAC CCCAGGACAGAGCTGGAGCCAGGGCCAGC TGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT . 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 



GT GGCT TCAT T T CAG TGGC TGACT T CCAGAGAGCAATATGGC TGGT T CCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTT CAACACAACCCCTCT TGT CACCATACAGCCAGAAGGGGGCACTAT CATAG TGAC CCAAA 
ATCG TAATAGGGAGAGAGTAGACT T CCCAGATGGAGGCTAC TCCCT GAAGCTCAGCAAAC TGAAG 
AAGAATGACTCAGGGATCTAC TAT GTGGGGATATACAGC T CATCAC T CCAGCAGCCCTCCAC CCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
u> ATAAGAATGGCACCTGTGT GACCAATCT GACAT GCT GCAT GGAACAT GGGGAAGAGGATGTGAT T 

Q TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
Q CTGGAGAT GGGGAGAAAGTGATATGACCT TCATCTGCGT T GCCAGGAACCCTGTCAGCAGAAAC T 

TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT TTCT GAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAG T GGACAT TTGTC GGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGT T TACT CCAC TGT GGAAATACCGAAAAAGAT 
GGAAAATC CCCACT CAC T GCTCACGAT GCCAGACACACCAAGGC TAT TT GCCTATGAGAAT GTT A 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD ICRE TPNI CPHS GENTE YDT I PHTNRT I LKEDPANTVYS TVE I PKKMENPHS LLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 



N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 




Transmembrane domain: 



amino acids 224-250 



Leucine zipper pattern. 



amino acids 229-251 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGT TGAGGAAGAC CAATTT TC T CAAAACCCCATCTCTT GCT T TGAGT GG TGGT TCCCAGGAAT T 
ATAGGAGCAGGT CT GATGGCCATT CCAGCAACAACAAT G TCCT TGACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 

AACAGTAAT GCCAATT G TGAAT T TT CAT T GAAAAAC AT CAGT GACAT TCAT CCAGAATCCT T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 

ACACCATGGCGAG T GGC TGGAGAGCAT C TAGT T TCCAC T T CGATTCT GAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 

CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG T T TAATGGGAAT AAAATGT AAGT ATCAGTAGT T T GAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTS CNGFS LLVLLLLGWLNAI PL IVSLVEEDQFS QNP ISC FEWWFPG 1 1 GAGLMAI PA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 

KNT SDIHPES FNLQW FFNDSCAPPTG FNKPT SNDTMAS G WRAS S FHFDSEENKHRL I HFS VFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites, 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 * 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, : 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 



App_ID= 10063 579 



Page 200 



FIGURE 4Q 
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ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAAC C TGCAGAGT CACACAGTTGC TGAC CAATTGAGC TGT GAGCC TGGAGCAG 
ATCCGTGGGGTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTT GC TGAGCACAGGACTGGCCTCCAGGGAT GGCCTGAAGCC TAACACT GGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLIJ\GLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAI PLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 



m 
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FIGURE 51 



GTGGAC TC T GAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGC T GCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGC T TT GGGCAACAGGGT C GGGGAAGCAGCCC ATGC TCT GGGAAACAC T GGGCACGAGA 
T TGGCAGACAGGCAGAAGAT G TC AT TCGACAC GGAGCAGAT GC TGTCCGCGGCTCCTGGCAGGGG 
Q) GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
ilj CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
. n GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
•T AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
Q ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 



s ! T GGCGGCAGCAGT GGT GGCAGCAG T GGCAACAGT GGTGGCAGCAGAGG T GACAGCGGCAGT GAGT 

n 



■ -I * 
. - .4 - 



CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GG T GGAGTCAAT ACTGT GAAC TC TGAGACGTC TCCT GGGAT GT TTAACT T TGACACT T TCTGGAA 
GAAT T T TAAATC CAAGC TGGG TT TC AT CAAC TGGGATGCCATAAACAAGGACCAGAGAAGC TC T C 
GCATCCCGTGACC T CCAGACAAGGAGCCACCAGAT T GGATGGGAGCCCCCACAC TCCC TC CTTAA 
AACACCAC CC T C T CAT C AC TAAT C T CAGCCC TTGCCCTT GAAATAAACC T T AGC T GCC CCAC AAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?W^AAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MKFQGPLACLLLM.CLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVTRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDWNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGI QGFRGQGVSSNMRE I S KEGNRLLGGS GDNYRGQGS S WGSGGGDAVGGVNTVNSETS PGM 
FNFDTFWKNFKSKLGFINWDAINKDQRS SRI P 

Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 



Glycosaminoglycan attachment site . 

Q amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CAT CCCC TT CATCGT TT TATGCCAGCCT GACACCAT CCGGTCTATCACCAAT GCCTCAGC T GCCA 
TTGCACCCAAGGATAATCTGTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGGCCGCCTTCCATTTCAACATCCT 
7*\ GAAGTCC TATATAACGATCT T CAACAAGAGT GCAAACATCATGCT T GACAAG T GGCAGCACC TGG 

II CCT CAGAGGGCAGCAGTCGTC TGGACAT GT TT GAGCACAT CAGCC T CAT GACCT TGGACAGT C T A 

yj CAGAAATGCATCT T CAGCTT TGACAGCCAT TGTCAGGAGAGGC CCAGTGAATATAT TGCCAC CAT 

CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 



W \ CAT TGT CAGATGAGGATATAAGAGCAGAGGC TGACACCT T CAT GT TT GGAGGCCAT GACACCACG 

GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 



ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCT CAT CGATAT TATAGGGGTCCAT CACAACCCAACTGT GTGGCCGGAT CC TGAGGT C TACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCT GT T T TT TT GCAGAT TGT CATGAATAAAACGG T GCTGT CAAA 
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FIGURE s J. 



MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAOT 

L I T PTEEGLKDS TQMSAT YS QGFTVWLGP 1 1 P F I VLCHPDT I RS I TNAS AAI APKDNL FI RFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
S LMTLDS LQKCI FS FDSHCQERPSE Y I AT I LELSALVEKRS QH ILQHMDFLYYL S HDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 



LAFIPFSAGPRNCIGQAFAMAEMKVVLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 



VGLQ 




Important features: 



Transmembrane domains: 



amino acids 13-32 (type II), 77-102 



Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 




N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE ss 

ATCGCATCAATTGGGAGTACCATCTTCCTCATCGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA T TCATGGCCAGTT T TAT GAAGC T T T GGAAGGCAC TAT GGACAGAAGC T GG TGGACAG TT T T 
GTAACTAT CT T CGAAACCT CT GT C T TACAGACATG T GC C T T T TAT C T TGCAGCAATGT GTT GCT T 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA&AA 
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FIGURE *6 



MGPVKQLKRMFE PTRLIAT IMVLLC FALTLCSAFWWHNKGLAL I FC I LQS LALTWYSLS FI PFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE B7 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTT CAAGCTCAGT GTCTT CAT CC CCT CC CAGGAATT CT C CACCTACCGC CAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGAC CTT GATGGGCAGCTAGACT TT GAAGAATT TGTCCAT TAT CTCCAAGATCAT GAGAAGAAGCTGAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGATATCTGAACAGCAGGCAGAAAAAATT CT CAAGAGCAT GGAT AAAAAC GGCACGAT GACCAT CGACTGG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
AT CTTTGAT GTGGGT GAGAATCTAACG GTC CC GGAT GAGTT CACAGT GGAGGAGAGGCAGACGGGGAT GT GGTGGAGA 
CACCT GGT GGCAGGAGGTGGGGCAGGGGCC GTATC CAGAACCT GCACGGCC CC CCTG GACAGGCT CAAGGT GCTCATG 
CAGGTCCAT GCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTT CACT CAGATGATTCGAGAAGGAGGGGCCAGG 
T CACT CTGGCGGGGCAATGGCAT CAAC GTC CTCAAAAT TGCCC CCGAATCAGC CATCAAATT CAT GGC CTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT CGC CCAGAGCAGCATCTACCCAAT GGAGGT C CT GAAGACCCGGAT GGC GCT GC GGAAGACAGGCCAGTACTCAGGA 
AT GCT GGACTGC GCCAGGAGGAT CCT GGCCAGAGAGGGGGT GGCCGC CTT CTACAAAGGCTATGT CCCCAACATGCTG 
GGCATCATCCCCTATGCCGGCATCGACCTTGC^GTCTACGAGACGCTCAAGAATGCCTGGCTGCAGC^CTATGCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGC^GGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGC^TCPlGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGG TGA CGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 

aagctgtctcgagccaagctgtgaaaaccctagacgcacccgcagggagggtggggagagctggcaggcccagggctt 
gtcctgctgaccccagcagaccctcctgttggttccagcgaagaccacaggcattccttagggtccagggtcagcagg 
ctccgggctcacatgtgtaaggacaggacattttctgcagtgcctgccaatagtgagcttggagcctggaggccggct 
tagttcttccatttcacccttgcagccagctgttggccacggcccctgccctctggtctgccgtgcatctccctgtgc 
cctcttgctgcctgcctgtctgctgaggtaaggtgggaggagggctacagcccacatcccaccccctcgtccaatccc 
ataatccatgatgaaaggtgaggtcacgtggcctccc^ggcctgacttcccaacctac^vgcattgacgccaacttggc 
tgtgaaggaagaggaaaggatctggccttgtggtcactggcatctgagccctgctgatggctggggctctcgggcatg 
cttgggagtgcagggggctcgggctgcctggcctggctgcacagaaggcaagtgctggggctcatggtgctctgagct 

GGCCT GGACCCT GTCAGGAT GGGCCCCACCTCAGAAC CAAACT CACT GTC C CCACTGT GGCAT GAGGGCAGT GGAGCA 

CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTT CTGGGGAGGGAAGGAAAAGGT GTT GGAGGCCTTAATTATGG 

ACTGT TGGGAAAAGGGTTTT GT C CAGAAGGACAAGCC G GACAAATGAGCGACTT CTGT GCTT CCAGAGGAAGACGAGG 

GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 

GGGACCAGCCC CACATTCCACTT GTGTCACTGCTT GGAAC CTATTTAT TTT GTAT TT ATTT GAACAGAGT TATGT CCT 

AACTATTTT TATAGATTTGT TT AATTAAT AGCTT GT CATTTT CAAGT T CAT TTTT TATTCAT AT TTAT GT T CATGGTT 

GAT TGTAC CTTCCCAAGCCCGCC CAGTGGGATGGGAGGAGGAGGAGAAGGGGGGC CTT GGGC CGCTGCAGTCACAT CT 

GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 

GT TGGGGAAGGGCTT GCCCCCAGCCTTAGGAT T T CAGGGTT T GACT GGGGGCGT GGAGAGAGAGGGAGGAACCTCAAT 

AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 

TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 

CTTCTGCTGCCCTTGCTTAACAATGCCGGCCMCTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 

TGAGGAAATCTTCAATAGGATGCAAAGATCAATGC^WW^TTGTTATATATGAACATATAACTGG^ 

CAAAT TAAGAAAGAATT GGACGTTAGAAGTTGT CAT TTAAAGCAGCCT TCTAAT AAAGTT GT TTCAAAGCTGAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE s8 



MLCLCLYVPVTGEAQTEFQYFESKGLPAELKS I FKLS VFIPSQE FSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRD YHLLHP VEN IPEII LYWKHST I FDVGENLT VPDEFT VEERQTGMWWRHLVAGGG 
AGAVS RTC TAPLDRLKVLMQVHASRSNNMG I VGGFT QM I REGGARS LWRGNG I NVLK I APE SAI K 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKGYVPNML G 1 1 PYAG I DLAVYE TLKNAWLQH YAWS ADPGVFVLL ACGTMSS T C 
GQLASYPLALVRTRMQAQAS IE GAPEVTMS SL FKH I LRTEGAFGL YRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 




Signal peptide: 



amino acids 1-16 



Putative transmembrane domains: 



amino acids 284-304, 339-360, 376-394 




Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 



N-glycosylation sites . 



amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 5 Q 



GGAAGGCAGCGGCAGCTCCAC TCAGCCAGTACCCAGATACGCTGGGAACCT TCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAAC AT T G G G GAG GAT G GAAT CC T G AGC T GCAC T TT T GAAC C T GACAT CAAAC T T T CT GATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGC TGTCGGAGCAGGATGAAAT GT TCAGAGGCCGGACAGCAGTGTT T GCTGATCAAGTGATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
\„ CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
% AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 

teal 

Q] CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
ill ' CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

W { TCAACAACACATAC TCC T G TAT GAT TGAAAAT GACAT T GC CAAAGCAACAGGGGATATCAAAGT G 

ACAGAATCGGAGAT CAAAAGGCGGAGTCACC TACAGC T GC TAAAC TCAAAGGCT T CTC TGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 



' ^ £ »» 



.: t 



GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATAT GACC TAG T T TTAT AT T TC TGGGAGGAAATGAAT T CAT AT CT AGAAGTC TGGAGT G 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAA.T 
CTATCT T CAAAGACATAT TAGAAGT TGGGAAAATAATT CATGTGAACTAGACAAGTGTGT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT TCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATT TTAGCATAAACAGAGCAGT 
CGGGGACACCGATT TT ATAAATAAAC T GAGCACC T TCTT T TTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWS 1 1 S 1 1 1 ILAGAIAL 1 1 G FGI SGRHS I TVTTVASAGNIGEDGILSCTFEPDIKLS 

DIVIQWLKEGVLGLVHEFKEGKDELSEQDSMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 

YI ITSKGKGNAJSTLEYKTGAFSMPEVNVDYNAS SETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 

NTSFELNSENVTMKWSVLYOTTIISOT 

CVSSFFAISWALLPLSPYLMLK 

Important features: 
Signal peptide: 



amino acids 1-28 



Transmembrane domain: 

amino acids 258-281 



N-glycosylation sites. 

Q amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 

220-224 



N-myri s toylation si tes . 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 

T GACGTCAGAAT CACC ATGG CCAGCTATCC TTACCGGCAGGGC TGCC CAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGT GGACTCAGATCACAGTGGCTATATCT CCAT GAAGGAGC TAAAGCAGGCCC TGGT CAAC T GCA 
ATT GGTCT TCAT TCAAT GATGAGACC T GCCT CATGATGAT AAACATGT TTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
C CAGCAGTATGACCGGGACCGC T CGGGCTCCAT TAGC TACACAGAGCTGCAGCAAGC TCT GT CC C 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAAT CC TGCCAT GCAGCT TGACCGCT TCAT CCAGGT G T GCACCCAGCTGCAGGTGC TGACAGA 
GGCCTTCC GGGAGAAGGACACAGC T GTACAAGGCAACATCCGGCTCAGCT T CGAGGAC T T CGTCA 
C CATGACAGC T T CT CGGATGCT ATGAC CCAACCATC TGTGGAGAGTGGAGT GCACCAGGGACCT T 
T CC TGGCT TC T TAGAGTGAGAGAAGTATGTGGACAT CTCTTCTTTTCCTGT CCC TCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
C TGACCAAGCATGAGAGAGATCTGT C TAT GGGACCAGT GGC T TGGAT TCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGCCAAAAGT CCAGTGAAAT T GTAAGCTTCAATAAAAGGAT GAAACTC TGA 
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FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNYDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLM^^ 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DT AVQGN I RL S FE D FVTMTAS RML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
;, ;3! ; TGCCTGGGGAGTCTTCTCGCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
f\ TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
01 CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATGACCCCTGGGGTACCCTGAATCCCAGTG 
Ul TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

. ** 

ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

! f 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 



'Si 



Lf,£ 



TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAA21AAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 6 J. 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPMDSWPPSDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 



N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 



3 E 



Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 
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FIGURE 6 s 



AAGGAGAGGCCACC GGGAC T TCAGT G TCTCCTCCAT CCCAGGAGCGCAGTGGCCAC TATGGGGT C 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACA7^ATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGT TGT C T GCAACACATGACAGCCAT TGAAGCCT GT GTCCT TC T TGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCGCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 



5 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site, 

amino acids 37-41 

N-myristoylation sites. 

amino "acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACC GAGGGT T CGAGGGAGGGACACGGACCAGGAAC CT GAGCTAGGTCAAAGACGCC CGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTC TC TGGGCAGT CAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACT TCAA 

AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTTTTATATATTTATATAAAATTAG 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGL P FLLARWGRAWGQI QTT SANENSTVLPS S T S S S S DGNLRPEAI TAI I WFS 
LLAALLLAVGLALLVRKLREKRQTEGT^ 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-rnyristoylation sites . 
amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTT GTTTTCCTCTTAGTT CTGT GCCT GCT GCACCAG 
TCAAATACTT CCTTCATTAAGCT GAAT AATAATGG CTT TGAAGAT ATT GT CAT TGTT ATAGATCCTAGT GT GCCAGAA 
GATGAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAGCT T CTAC GTACCT GTTT GAAGCCACAGAAAAAAGA 
TTTTT TTTCAAAAAT GTATCTATAT TAATT CCTGAGAATT GGAAGGAAAATCCTCAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACATGCT GAT GT TAT AGTT G CACCACCTACACTCCCAGGTAGAGATGAACCATACACCAAGCAGT T CACA 
GAATGT GGAGAGAAAGGC GAATACATT CACT TCACC CCT GACCTT CTACT TGGAAAAAAACAAAATGAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACCGTGCTAAGT CAAAAAAAAT CGAAGCAACAAGGTGTTCCGCAGGTAT CTCTGGT AGAAATAGAGT TTATAAGT GT 
CAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACATWSlCTGTATGGAAAAGATTGTCAATTCTTT 
CCTGATAAAGTACAAACAGAAAAAG CAT CCATAATGTTTATGCAAAGTATTGATTCT GTTGTTGAATTTTGTAACGAA 
AAAACCCATAATCAAGAAGCT CCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACAT GGGAGGT GATTAGCAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTC^TTGCTGAAGATC^GTC^ 
AGAATT GTGTGCTTAGTTCTT GATAAGTCTGGAAGCAT GGG GGGT AAGGACCGCCTAAAT C GAATGAATCAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AATAAGCTAATC CAAAT AAAAAGCAGT GAT GAAAGAAACACACTCATGGCAGGATTACCTACAT ATCCTCT GGGAGGA 
ACTT CCAT CT GCT CT GGAATT AAATAT GCATTTCAGGTGATT GGAGAGCTACATTC CCAACTCGATGGATCC GAAGTA 
CT GCT GCT GACT GAT GGGGAGGATAACACT GCAAGT T CTTGTATT GATGAAGT GAAACAAAGT GGGGCCATTGTT CAT 
TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 
TC AGAT GAAGCT CAGAACAAT G GCCT CATT GATGCTTT T GGGGCT CT TACATCAGGAAATACTGATCT CTCCCAGAAG 
TCCCT T CAGCT C GAAAGTAAGGGATTAACACT GAATAGT AAT GCCT GGAT GAACGAC ACT GTCATAAT TGATAGT ACA 
GT GGGAAAGGACACGTT CTT T CT CAT CACAT GGAACAGTCTGCCT CCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
AT AAT GGAAAATTTCACAGT GGATGCAACTT CCAAAAT GGCCTAT CT CAGTATTCCAGGAACTGCAAAGGT GGGCACT 
TGGGCATACAAT CTTCAAGC CAAAGCGAAC C CAGAAACATTAACT AT TACAGT AACTTCT CGAGCAGCAAATT CTT CT 
GTGCCTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATG^ 
ATTCTACAAGGAT ATGTACCT GTTCTT GGAGC CAATGT GACT GCTTT CATT GAATCACAGAATGGACATACAGAAGTT 
TT GGAACTTTTGGATAATGGT GCAGGC GCT GATT CT TT CAAGAAT GATGGAGT CTACTCCAGGTATTTTACAGCATAT 
AC AGAAAAT GGCAGATATAGCTTAAAAGTT C GGGCT CAT GGAGGAGCAAACACTGCCAGG CTAAAAT TACGGCCT CCA 
CT GAATAGAGCC GCGTACATACCAGGCT GG GT AGT GAACGGGGAAATTGAAGCAAACCCGCCAAGACCTGAAATT GAT 
GAGGAT ACTCAGACCACCTT GGAGGATT TCAGCCGAACAGCATCC GGAGGT GCAT TT G TGGTATCACAAGT CCCAAGC 
CTTCC CTTGCCTGACCAATAC CCACCAAGT CAAAT CACAGAC CTT GAT GCCACAGTT CAT GAGGATAAGAT T ATT CTT 
ACATGGACAGCAC CAGGAGAT AATTTT GATGTTGGAAAAGTT CAAC GTTATAT CAT AAGAATAAGT GCAAGT ATT CTT 
GATCTAAGAGACAGTTT TGAT GATGCTCTTCAAGTAAATACTACT GAT CT GTCAC CAAAGGAGGCCAACTCCAAGGAA 
AGCTT T GCATTTAAAC CAGAAAAT ATCT CAGAAGAAAAT GCAAC C CACAT ATTTAT T GCCAT TAAAAGTAT AGATAAA 
AGCAATTT GACAT CAAAAGTATCCAACATTGCACAAGT AACTT TGTTTAT CC CTCAAGCAAATCCT GATGACATT GAT 
CCTACACCTACTC CTACTCCT ACTCCTACTC CTGATAAAAGT CATAATTCTGGAGT T AAT AT TT CTACGCT GGTAT TG 
T CTGT GATT GGGT CTGT TGTAATTGTTAACT T TATT TT AAGT ACCACCATT TGAA C CT TAACGAAGAAAAAAATCT TC 
AAGTAGACCT AGAAGAGAGTT TT AAAAAACAAAACAAT GTAAGTAAAGGATAT TTCT GAAT C TTAAAATTCATCC CAT 
GT GTGATCAT AAACTCATAAAAATAATTTTAAGAT GTC GGAAAAGGATACTTT GATT AAAT AAAAACACT CAT GGATA 
TGTAAAAACT GTCAAGATTAAAATT TAATAGTTTCAT T T ATT T GT TATTT TAT TTGT AAGAAATAGT GAT GAACAAAG 
AT CCTT TTT CATACTGATAC CTGGT TGTATATTATTT GATGCAACAGTTT T CT GAAATGATATTTCAAATT GCAT CAA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 



r * 

■r, i 



■ ^: 

U 



yj 

■i s I 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNErDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQS I DS VVEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDPSGTIMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANTPETLTITVTSR 
AANSSVPPITWAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGVYS R Y FTAY TENGRYS LKVRAHGGANTARLKLR P PLNRAAY I PG WWNGE I EANP P 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRISAS ILDLRDS FDDALQVNTTDLSPKEANSKES FAFKPENI SEENATHI FIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGT GGAAAC C CTGGGAGTAGAGTACTGACAGCAAAGA.C CGGGAAA.GACCATACGT CCCC GGGCAGG GGTGA 
CAACAGGT GT CAT CTTTTTGATCTCGT GTGT GGCT GCCTTCCTATTTCAAGGAAAGACGC CAAGGTAATTT T GAC CCA 
GAGGAGCAAT GAT GTAGCCACCT CCTAACCT T C CCT TCTTGAACC C CCAGTTATGC CAGGAT TTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT G CT CAGAAGT GACTT GCCT GAGGGT GGAC CAGAAGAAAGGAAAGGTCC CCT CTT G CTGTT GGCTGCACAT CAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGT CATT GCTTCT GCCT GCAAGATCATCCTTTAAA 
AGTAGAGAAGCT GCTCT GTGT GGTGGTTAACT CCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGAT GCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGA ATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTT GCTGGTGCTCCTCTGCTGTGCTATCT CTGT CCT GTACATGTTG GCCTGCACC CCAAAAGGT GACGAGGAGCAG 
CT GGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTAC CAGGCCGT CCTTCAGGAGTGG GAGGAGCAGCAC 
CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
■r~ AAT GG GCAGT AC CAAGCCAGCGATGCT GCTGGCCT GGGT CT GGACAGGAGCCCCCCAGAGAAAACC CAGG C CGACCTC 

1% CT GGC CTTCCTGCACT CGCAGGT GGACAAGGCAGAGGT GAAT GCT GGCGT CAAGCT GGCCACAGAGT ATGCAGCAGTG 

'Zl CCT TT CGATAGCT TTACTCT ACAGAAG GTGTACCAGCT GGAGACT GGCCTTAC CCG C CACC CCGAGGAGAAGCCT GTG 

AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
ft] CC CAAT CACC GT C CTTACACGGC CTCTGATT T CATAGAAGGGATCT ACC GAACAGAAAGGGACAAAGGGACATT GTAT 

T A GAGCT CACCTTCAAAGGGGACCACAAACACGAATT CAAACGGCTCATCTTATTTCGACCATT CAGC CCCAT CAT GAAA 

* GTGAAAAATGAAAAGCT CAACAT GGCCAACACGCTTAT CAAT GTTATCGT GCCTCTAGCAAAAAGGGTGGACAAGTTC 

y! CGGCAGTT CATGCAGAATTT CAGGGAGATGT GCATTGAGCAGGAT GGGAGAGT CCAT CTCACTGTT GTTTACTTT GGG 

iT t AAAGAAGAAATAAAT GAAGT CAAAGGAATACT TGAAAACACT TCCAAAGCTGCCAACT TCAGGAACT T TACCTT CATC 

CAGCT GAATGGAGAAT TTTCT CGGGGAAAGGGACTTGAT GTT GGAGCCCGCTT CTGGAAGGGAAGCAACGT CCTT CTC 
?- 1 1 TTTTT CTGTGAT GTGGACAT CTACTTCACAT CTGAATT CCTCAATACGTGTAGGCT GAAT ACACAG CCAGGGAAGAAG 

GTATT TTATCCAGTT CTTTT CAGTCAGTACAATCCTGGCATAATATACGGCCACCAT GAT GCAGTCCCTCC CTT GGAA 
^ CAGCAGCT GGTCATAAAGAAGGAAACT GGAT T TTGGAGAGACTTT GGATT TGGGAT GACGT GTCAGTATC GGTCAGAC 

L'| TT CAT CAATATAGGT GGGTT T GATCTG GACAT CAAAGGCTGGGGC GGAGAGGATGT GCAC CT TT AT CGCAAGTAT CTC 

in CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCT CTGGCAT GAGAAGCGCT GCATGGACGAG 

^ S ' CT GAC CCC CGAGCAGTACAAGAT GTGCATGCAGTC CAAGGCCATGAACGAGGCATCC CACGGC CAGCT GGGCATGCTG 

Q GTGTTCAGGCACGAGATAGAGGCTC^CCTTCGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGaACTCCCAGA 
; \ GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
*M GGACGACAAAAGAATT GGACT GATGGGT CAGAGAT GAGAAAGCCT CCGATTTCTCT CTGTT GGGCTT T TTAC AACAGA 

Q AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
jsjy AGATTATAAGCC T AAT GGTGT GGAGGTTTTGATGGT GTTTACAAT ACACT GAGACCT GTT GT TTTGT GTGCT CAT TGA 

i AATATT CATGATT TAAGAGCAGTTTTGTAAAAAATT CATTAGCAT GAAAGGCAAGCATATTTCTCCT CATAT GAAT GA 

GC CTAT CAGCAGGGCT CTAGTTTCTAGGAAT GCT AAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTAT GATACT 
AGTGAGTAC^TTAAGTAAAATAAAATGGACCAG^AAAGAAAAGAAACCATAAATAT CG TGT CAT ATT TTC CC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTT TTTCCCTTGT GAGTTATAGT CT GCTTAT TTAAT TACCACT TT GCAAGCCTTAC AAGAGAGCACAAGTT GGCCTAC 
ATT TTTATATTTT TTAAGAAGAT ACTTT GAGATGCATT ATGAGAACT TT CAGTTCAAAGCATCAAATT GAT GCCATAT 
CCAAGGACAT GC CAAAT GCT GATTCT GT CAGGCACTGAAT GT CAGGCATT GAGACAT AGGGAAGGAAT GGTTTGTACT 
AATACAGACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
ACTTT CTGCT TTACAGAAAAGGAAACT CATT CAGACTGGTGATAT CGTGATGTACCTAAAAGTCAGAAACCACATTTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGT GCT CCT C CTGGCTTCTGGCTT C CATAAGAAGAAAT GGAGAAAAATAT ATAT ATATAT ATATATATT GT 
GAAAGATCAATC CATCT GCCAGAATCTAGTGGGAT GGAAGTT TTT GCTACATGTTAT CCACCC CAGGCCAGGT GGAAG 
TAACT GAATTATT TTT TAAAT TAAGCAGTTCT ACT CAAT CACCAAGATGCT TCTGAAAAT T GCATTTTATT ACCAT TT 
CAAACTATTT T TTAAAAATAAATACAGTTAACATAGAGT GGT TTCTT CATT CATGT GAAAATTATTAGCCAGCAC CAG 
AT GCATGAGCTAATTAT CTCT TT GAGT CCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTT CAAGAAC 
ATT CAAGCT GTTGGTGT GTT AAAAAAT GCATT GTATT GAT TT GTACTGGTAGTTTAT GAAATTTAATTAAAACACAGG 
CCATGAATGGAAGGTGGTATT GC ACAGCTAATAAAATAT GATTT GT GGAT ATGAA 
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FIGURE 72 

MMMVRRGLLAWISRVWLLVLLCC^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AE VNAGVKLAT E YAAVP FDS FT LQKVYQLE TGL TRH PEEKP VRKDKRDE L VEAI E SALE T LNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHE FKRLILFRPFSPIMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEE INEVKGI LENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

'■ f 

% KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
% RKQKQKT S SKKT 

tet$ 

■> » -w 

y '■ 

yj Important features: 

Signal peptide: 

'd amino acids 1-27 

i. ; • 

Q N-glycosylation sites. 

J] amino acids 315-319, 324-328 

• S/M ft 

I 'i '* 

N-myristoylation sites. 

Ill amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAAC CCAT GATACCC TAC TGAACACCGAATCCCC T GGAAGCC CACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GC CCAGTCGCC CATCGATAT TCAGACAGACAGT GTGACAT T TGACCCTGATT T GCC 
L-« T GC T C T GCAG CC CC AC GGAT AT GAC C AGC C TGG CACC GAGC C T T TGGACC T G CAC AAC AAT GGC C 

» T 

H ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 

GCC CAGCT C CACCTGCAC T GGGGT CAGAAAGGATCCC CAGGGGGGTCAGAACACCAGAT CAACAG 
TGAAGCC AC AT T TGCAGAGCT C CACAT TGTACATTAT GAC T CT GAT TCCTATGACAGCT T GAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCTTATGAACACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 



GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 



«f } AT T TCAATGGAACAGCT GGAAAAGCT TCAGGGGACAT TGT TCT CCACAGAAGAGGAGCCCT C TAA 



jf > 



GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
^ TCCAAGCAGGAT CC T CGTATACCACAGGTGAAATGCT GAGTCTAGG TGTAGGAATCT T GGT T GGC 

fU TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

CCGAAAGAGTGTGGTCT TCACCTCAGCACAAGCCACGACTGAGGC ATAAA TTCCTTCT CAGATAC 

CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCT G TCAT TCCAGGAAGAAC T GCAGAGCCTT CAGC CTCT CCAAACAT GTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 

ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCC T TAGGATAAAGAGT T GCT GT TGAAG T T GTATAT T T T T GATCAATATAT T T GGAAAT TAAAG 
TTTCTGACTTT 
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FIGURE 74. 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGIAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QOTRALQPLNQIWFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAWFIARKIRKKRLENRKS 
WFT SAQAT TEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 

ATGGACTCCACCAGAGGGTCTT CAAGGAGTTAAAGT TACT TACACTGTGCAGTATTTCAT CACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACC TTCC TGT TTCGAT GCAACAAATATAC T CCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
^ AGGGCCCCCTCGGCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
% CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
m TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
Uj GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
m ACT TT AT CACCCT CAATAT CTCGGAT GAT TC TAAAAT TT CTCAT CAGGATATGAG T T T AC TGGGA 

** AAAAGCAG T GATGTATCCAGCCT TAATGAT CC T CAGC CCAGCGGGAACCTGAGGCCCCCT CAGGA 

GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
AAAACAC GGAAGGTAC T TCTCT CACCCAGCAAGAGTC CC TCAGCAGAACAATAC CCCCGGATAAA 
Ul ACAGTCAT TGAATATGAATATGATGT CAGAACCAC TGACATTTGTGCGGGGCCTGAAGAGCAGGA 

O GCT CAGT T TGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 

% TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
SJi CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

f 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAAT GAAACC TAT CTCATGCAAT TCATGGAGGAAT GGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYWSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFYPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLKDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGGCTGGTGAACACCGTCCTGAAGCACATG 
ATCTGGCTGAAGGTCATCACAGCTAAGATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGGTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCAC CAT C CGCAT GGACACCAGT GC AAG T GGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
C CAAT C TAGT GAAAAACCAGCT G T GT CCCGTGATCGAGGCT TC CT T CAAT GGCAT GTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCGATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T GAT CAAT GAAAAGGC T G CAGATAAG CT G GGAT C TAC CC AGAT C GT GAAGAT C C TAAC T CAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACACCAAAGGTGACCAAC T TATACT CAACT TGAATAACAT CAGC TCT GATCGGAT CCAGC TG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 



a 



hi 



• Mi* 

X 1* > 
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MAGPWT FTLLCGLLAATL IQATLS PTAVL I LGPKVI KEKLTQE LKDHNAT S I LQQLPL LSAMREK 
PAGGIPVLGSLVNTVLKHI IWLKVITANI LQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLWAIAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KW FKFNSAAS LTMPTLDN I PFS L I VS QDWKAAVAAVLS PEE FMVLLDS VL PE SAHRLKS S I GL IN 
EKAADKLG STQ I VK I L T QDT PE FFI DQGEAKVAQL I VLE VFP S SEALRPL FTLG IE AS SE AQFYT 
KGDQL I LNLNN I S S DRI QLMNS G I G WFQPDVLKN IITEIIHSI LLPNQNGKLRS GVPVS LVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 



AppJD-10063579 



Page 230 of 320 



FIGURE 7Q 



GAGAGAAG TCAGCC TGGCAGAGAGAC TCTGAAATGAGGGAT TAGAGGTGT TCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCT'GGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
C CAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCAT CAC CCAG TGTGACAT CTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGT GGAGTC T T TT T CAT CCT TGGAGGCCTCCT GGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTAGTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGT CTGT GAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACT GGATCGT 
GTCAGAAGGTGC TGC TGAGGATAGAC TGAC T T T GGCCATT GGAT TGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T T GTGCATGAACTGAAATAAAACCATCC TACGGT AT CCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGY I LGLLGLLGTLVAMLLPSWKTS S YVGAS I VTAVGFS KGLWMECATHS TG I TQCD 
IYS TLLGLPAD IQAAQAMMVTS SAI SSLAC 1 1 S WGMRCTVFCQESRAKDRVAVAGGVFFI LGGL 
LG F I PVAWNLHG I LRDFYS PLVPDSMKFE IGEAL YLG 1 1 S SLFS L I AG 1 I LC FSCS SQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 



Transmembrane domains: 



amino acids 82-102, 117-140, 163-182 




N-glycosylation si te . 



amino acids 190-193 



PMP-22 / EMP / MP20 family proteins. 



amino acids 46-59 
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FIGURE 81 
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CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 

CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 

GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 

GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGG ATGG TTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 

CTCGCGCTGCTGTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATC TAC TGCGGCC T CACAACC TGCCCC GAAC CAGGC TGCCCAGC ACCC CTCCC 
ACTGCCAGACTCCTGCTGCCAAGCCT GCAAAGATGAGGCAAGT GAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C TCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGAT GAGGAAACTGAGGCTCAGAGAG 
G TGAAGTACCTGGCCCAAGGCCACACAGC CAGAATCT TCCACT TGACTCAGATCAAGAAAG T CAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAAC GT CTT CCTAGCCCAGACCCTGGAGCTGAAGGT CACGGC CAGT CCAGACAAAGT GACCAAG 
ACATAACAAAGACC TAAC AGT TGCAGATATGAGCTGTATAATT GTTGT TAT TAT AT AT TAAT AAA 
TAAGAAGT TGCAT TACCC T CAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVLVHT S VSPSPDNLRRFALEHEAS DLVE I YLWKLVKDEE TEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 



y Signal peptide: 

Zl amino acids 1-25 



•v. i 



Pi 



! 2 f. 
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FIGURE 8 s 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TC CGT T TCCATGCCGT GAGGTCCATTCACAGAAC AC AT CCATGGCT CTC ATGCT CAGTTTGGTT CTGAGTC 
TCCTC AAGCTGGGAT C AGGGCAGT GGCAGG TGT T TGGGCC AGACAAGCCTGT CC AG GCCTTGGT GGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTTCT CTAGCGT GGTCCACCT CT ACAGGGACGGGAAGG ACC AGCC ATTT AT GC AG AT GCCAC AGT ATC 
AAGGC AGG ACAAAACTGGTGAAGGATTCT ATT GCGGAGGGG CGC AT CTCTCT GAGGCT GG AAAAC ATT ACT 
GT GT TGGAT GCT GGCCTCTATGGGTGC AGG AT TAGTTCCC AGTCTT ACTACC AGAAGGCC AT CT GGGAGCT 



f*\ ACAGGTGTCAGCACTGGGCT CAGT TCCTCTCATTTCCATCACGGGATATGTT GAT AGAGACATCCAGCTAC 

f s % TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGT GGAAAGGTCCAC AAGG AC AGGAT T TGTCC 



ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGT GGAGATCT CTCTGACCGT CCAAGAGAA 
Uf CGCCGGGAGCAT AT CCTGTT CCAT GCGGCATGCT C ATCTG AGCC GAGAGGTGGAATCC AGGGT AC AGATAG 

U] GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 



TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACC CGAAGCTCTGCGTTTCTG AT CTGAAAACT GTAACCC AT AGAAAAGC TCCC CAGG AG GTG 
CC T CACTCTGAGAAGAGATT TACAAGGAAG AGT GTGGTGGCT TC TC AG AGTT TCC AAGC AGGGAAAC ATT A 
CT GGGAGGTGGACGGAGG AC AC AAT AAAAGGT GGCGCGTGGGAGTGTGCCGGGATGATGT GGACAGGAGGA 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT TATTGAGGCCCT AC ATTGAGTATCCGT CCT AT AAT GAGC AAAATGGAACTCCCAT AGT CATC TGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTC CTCCTC ACAGGC AACC AC GCCCTTCC T CCCCAGGGGTGAAAT GTAGGATG AATC AC AT CCC ACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GAT GAAGGGGGACT GGCCTGTCCACATGGG AGT C AGGTGTCATGGCTG CCCT GAGCTGGG AGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACC GT CAGGAATT CCC ATCT C AC AGGCTGTGGTGT AG AT T AAGT AGACAAGGAATGT GAAT AATGCT T AG 
AT CTT ATTGATGACAGAGTGTATC CT AATGGT T T GT TC ATT AT ATT AC ACTT TCAGT AAAAAAA 
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MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAKMEVRFFRGQFSSVVH 
L YRDGKDQP EMQMPQYQGRTKLVKDS I AEGR I S LRLENI T VLDAGL YGCRI S S QS YYQKAI WELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKE YVTLS PDHGYWVLRLNGEHL YFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I VI C PVTQES EKEAS WQRAS AI PETSNSES S S QAT T P FLPRGEM 

Signal peptide: 
amino acids 1-17 

Transmembrane domain : 

amino acids 239-255 
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FIGURE 8s 

AACAGACGTT CCCTCGCGGCCCTGGCACC TC TAAC CCCAGACATGC TGCT GC TGCT GCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TAC T TCT TT CGTATGGAGAAAGGAAGTATAAAATGGAAT TATAAACAT CACCGGCT CT CT GTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGT GT CCTACCCGCCTCAGAACTTGACCATGACTGTCT T CCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAG TGAC T CAGGGGGTGG T GGGGGGAGCT GGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCT CAGGGGCCCCTGACT GAACCT TGGGCAGAAGACAGT CCCCCAGACCAGCC T CCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACT GCAGAGAC TCACCCTGAT TGAGGGATCACAGCC CCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGATTCT TGTAGAATTAAC AGCCC TCAACG TGAT GAGC TATGATAACACTAT GAAT TAT G 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
TCCCTT T TAT T T T T T TAACT AAAAGACAGACAAAT TCCTA 
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MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHGYWFREG 
ANTDQDAPVATNNPARAWEETRDRFHLLGDPHTK^^ 

KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKAT S GVTQGWGGAGATALVFLSFCVI FVWRSCRKKSARPAAGVGDTG I E 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAG T GCAGC TGAGACTCAGACAAGAT TACAATGAACCAAC TCAGCT T CCT GCT GT TTC 
TCATAGCGACGACCAGAGGATGGAG TACAGAT GAGGCTAATAC TTACTT CAAGGAAT GGACCT GT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACC T T TGGAT C TGCAGAGGCGGCCACGAGCGAT GAC TACAAGAACCCT GGCTAC T 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCT GAGG TAC CGCACGGACAC TGGC TTCCTCCAGACAC TGGGACAT AATCT 
G T T T GGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAG TGT TGGAC TGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGC GGGAAT TCAC TGCGGGAT T T GT TCAGTTCAGGGT AT TTAATAACGAGAGAGCAGCCAA 
CGCCT TGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAAC T TACCCAGTAGC TAGAAT GT TAATGGCAGAAGAGAAAACAATAAATCATATT GACT CAAGA 
AAAAAA 
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MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTSNGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 7.4-78, 289-293 

N-myristoylation sites . 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCAC T GAAAT GGC T T TAT TCG T CACC GT C TT TGGGCT GAAAAAGAAACCC T TCTGAT T AC CT T CA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
TTAAGACTTATATACAGTTTTAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acicis 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGC GAGGGGAC GAGAGC GGAGGAGGAAGATGCAAC TGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTT T GGCT GGGGCGACT T CTACTCCAACAT CAAGACGGT G 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
^ CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 

>• 

%t AGT TC CACCAGGAACAGCAGAT CT TCAT C GAAGCCAAGGCCTC CAAAATC T TCAACT GCC GGAT G 

m GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACC TCGC T T T GCACCCACGACCCAGCCAAGAT C T G 

.y ' 

Uj CTCCCGAGACCACGCTCAGAGCTCAGGCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
!j CAT AGT GAT ACCCC CTAC TACC CAT CTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

hi 

TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
Q AGGCAGGGAGGGGGGTGGAGAC GAGGAGATGCCAAGT GGGGCCAGGGCCAAGT C TCAAGT GGCAG 

L11 AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 

AS.fr 

\*\ AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 

lit 

% TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
fll CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCC TTAAGC TAAGACAGGAC GAT TGTGGTCC TCCCACACTAAGGC CACAGCCCAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGS5DPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 




N-glycosylation sites. 



amino acids 62-65, 127-130, 137-140, 143-146 



2-oxo acid dehydrogenases acyl transferase 



amino acids 61-71 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
AT GT TCC GAT T TGCATAT TATAAACTCT TAAAAAAAGCCAGT GAAGGT T T GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
%l GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
}*l ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 

hit 

01 TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
yj TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 

: SJ? 

y \ AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 

: f\ AACC T CAGGGAACCAGCACT TCCCAAACCGCAGAC TACATCT T TAGAGGAAGCACAAC TGTGCC T 

TT TTCTGAAAATCCCTT TT T CT GGTGGAAT TGAGAAAGAAATAAAACTATGCAGAT A 

*^ 

^■«?> 
; 53 

\ £ S 

: 

~_*> *r '+ 

r f 
■ * r 
t » 

hi 

ff 

* 1 

m 
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FIGURE Q J. 



MTAAVFFGCAF I AFGPALALYVFT I AI EPLRI I FL I AGAFFWLVSLL I S SLVW FMARVI I DNKDG 
PTQKYLL I FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTOAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide : 

amino acids 1-19 




Transmembrane domains: 




amino acids 32-51, 119-138, 152-169, 216-235 



Glycosaminoglycan attachment site. 



amino acids 120-123 



Sodium: neurotransmitter symporter family protein 



amino acids 31-65 
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FIGURE 



AAT T TT T CACCAGAGTAAAC T T GAGAAACCAAC TGGACCT T GAG TAT TGTACAT TTTGCCTCGTG 
GACCCAAAGG TAGCAATCTGAAACATGAGGAGTACGATT C T AC TGT T T TG TC T T CT AGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC TACCAAACCAACAGCAGT CAAATCAGG TC T TTC CTTC TT TAAGT CTGATACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCAT TGACCCT GGGAGGGT TGAATGTACAACAGCAAC TGCACCCACAT GT GT TAC 
CAAT TT TTGT CACACAACT T GGAGCCCAGGGCACTATCC TAAGCT CAGAGGAAT TGCC AC AAA.TC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCC TCCCAAC T CCCAG T GGCACAGAT GACGACTT T GCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACAC AT GCCAT CGAGGAAGCCACCACAGAAT CAGCAAATGGAAT TCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATT GAT TATAT TATGGAATAGATT GAGACACATTGGATAGT C T TAGAAGAAAT T AAT T CT TAATT 
TAC CTGAAAATAT TCTT GAAAT T T CAGAAAATATGTTCT AT G T AGAGAATCCCAACT T TT AAAAA 
CAAT AAT TCAATGGATAAAT C T GT C T T T GAAATATAACAT T AT GCTGCCTGGAT GATATGCATAT 
TAAAACTTATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
L PQ I FT S L 1 1 HS L FPGG I L PT S QAGANPDVQDG SL PAGGAGVNPATQGT PAGRLPT PS G 
T DDD FAVT T PAG I QRS T HAI SEAT TE S ANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACC ATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGG 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCC T TGACCACGCCC TGGAGAGGAC C TACCAGC TGT T GGTACAGGTCAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGGACCTGGGTGT 
CCCTAGAGCCTATCCACC TGGCAGAGAAT C TCAAAG T CC TATACCCGCACCACATGGCCC AGGTA 
CACTGGAGTGGGGGTGATGTGCAGTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAG T CCAC CAGG T AC T GAAGT GAC TAG AC TGT CAG CAG AGGAT GCAGAT GCC CC CGG C TC C CCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGT CGCAGTCACAGATATCAAT GATCACGCC CCTGAGTT CAT CACT TCCCAGAT T GGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
GCCT GGATT GGGAGCCAGACTCTGGGCATGTTAGAC TCAGACT C TGCAAGAACC TCAGT T AT GAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T TCACCCAC TGGACCATG T CAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAA?VAAAAAAZW^G 
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FIGURE 08 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S TS LDHALERT YQLLVQVKDMGDQASGHQATATVE VS 1 1 EST WVS LE P I HLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWWVQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
yi . REHI IPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGI FLI 

LIFTHWTMSRKKDPDQPADSVPLKATV 



32 



; : s 

/■■■■» .'! 



5 



Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CC AAGAT TC T TCCCAGGAACACAAACGTAGGAGACC CACGCTC C TGGAAGCAC CAGCC TT TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATC TTGCTT TCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
TTCTCCT TATGTT TGGTC TACTAT TGCATTTAGAAGCTGCAACAAAT TCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAG T CACCAAC TC T GAGT T CCAT ACAACC TCCAGTGGGAT CAGCACAGCCACCAACT CTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCT GAGT CCAGCACACCCTCCAGT GGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AG T AGGGCCAGCAC T GCCACCAAC T C TGAGT CTAGCACACT CT CCAGTGGGGCCAGCACAGCCAC 
CAACT CT GAC TCCAGCACAACC TCCAGTGGGGC TAGCACAGCCACCAACTCTGAGT CCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAAC TCTGAC TC CAGCACAGTGTCC AGT GGGGC CAGCAC T GCCACCAACT CT GAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACT CT GAC T CCAGCACAACCT C CAG T GGGGCCGGCACAGCCAC CAACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AG T GGGGCCAACACAGCCAC CAACT CTGAGT CCAG TACGACCT CCAGTGGGGCCAACACAGCCAC 
CAAC TC T GAG TCCAGCACAGTGTCCAGTGGGGCCAGC AC TGCCACCAACTCTGAG T CCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACT C TGAC TCCAGCACAACCT CCAGTGAGGCCAGCACAGCCAC CAACT CTGAGTCTAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G TGGAG TCCTAAC TGGT TCTGGAGGAGACCAGT AT CATCGATAGCCATGGAGAT GAGC GGGAGGA 
ACAGCGGGCCC TGAGCAGCCCCGGAAGCAAGT GCC GCAT T CT T CAGGAAGGAAGAGACCT GGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT CTTGAAGAAGGTAT TCCTCACC T TTCTTGCCTT TACCAGACAC T GGAAAGAGAATACTATAT 
TGC T CATT TAGCTAAGAAATAAATACAT C TCATCTAACACACACGACAAAGAGAAGC T GT GCT T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVS IVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGAS TATNSESSTPSS 
GASTVTNSGS SVTS SGASTATNSES STVS SRAS TATNSESSTLSSGASTATNSDS STTS SGAS TA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTS SGAS TATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 




amino acids 1-20 



Signal peptide: 



Transmembrane domain: 




amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 

CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 
GGT G TCATGGAAAGGGAT TT AC TT TATAC TGAC TC TGT T T T GGGGAAGC T T T T T TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
H* CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
5f GAT TATAACT GGGGATGCAT T T GT TCCT GGAGAAAGAAGTGT CAT TATCATGAACCAT CGGACAA 

GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTT CAT T CATAGGAAATGGAAGGAT GACAAGAGCCAT T T CGAAGACATGAT TGAT T ACTT TTGTG 
ATAT TCACGAACCACTTCAACT CCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
T CT CGAAG TAATGCATT TGCTGAAAAAAATGGACTTCAGAAAT ATGAATATG TT T TACAT CCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 

>; ■ 

f] ATATCAC TGTGGCG TAT CC T CACAACAT TCCTCAAT CAGAGAAGCACC T CCTCCAAGGAGACTT T 

CCCAGGGAAATCCACTT TCACGTCCACCGGTATCCAATAGACACCC TCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT TT T TAT TT T ACCGGACAGAGT G TCAT TCCACCT TGCAAGT C TGAACT CAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATAT T T GG T GGACTGGAGAT CATAGAACTT GCAT GT TACCGACT T T TACACAAACAGCCACAT 
T TAAAT T CAAAGAAAAATGAG TAAGAT TATAAGGTT T GCCAT GT GAAAACCTAGAGCATAT T TT G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATT AT T TGT TAAAGATAT T TTGCACT TAAT T T T G T GGGAAAAATAT TGCT ACAAT T TT T 
TTTAATCTCTGAATGTAAT T TCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACT T 
GGGCC AGAATATTAT TAAACAATCATCAGGC T T T TAAA 



; c:: 



'.153 

B 
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FIGURE 102 

MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKS FGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPMFVNPSWYRWINNRLVATOLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWi\MQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 

YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAgCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CAC C T CT ACAGAGATGGGGAAGACTGGGAATCTAAGCAGAT GC CACAG TATCGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
y ; AGGAT T T GTC TTCAGAC TCCAGAGCAAATGCAGATGGGT AC AGCCT GTATGAT GT GGAGATCTCC 

'% h ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 

GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
O TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
ill AAATCCAAAGGGAAAAT CCAGGCGGAAC TGGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

|jj AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
fn TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
X f\ TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
"fi CGT GGGACAAAATGTAGGG TGGTATGTGGGAGTGT G T CGGGAT GACGTAGACAGGGGGAAGAACA 

J! ATGTGACT TTGTCTCCCAACAATGGGTAT TGGGTCC T CAGACT GACAACAGAACAT T T G T ATT TC 

% ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
jis GGACTAT GAGGGTGGGAC CATC TCCT TCT TCAATACAAATGACCAGTCCC T TAT T TAT ACCC TGC 

% TGACATGT CAGT T TGAAGGCT T GT TGAGACCCTATAT CCAGCATGCGATG TAT GACGAGGAAAAG 

U'f GGGACT C CCATAT TCATAT GTC CAGT GTC CTGGGGATGAGACAGAGAAGACCC T GCT T AAAGGGC 

CJ CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGAOVGGTGGCCCCAGCTTCCTCT 
j^j CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 

TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
% TCAGAAGCCAT GGC TGCCCTGAAGT GGGGACGGAATAGAC T CACAT TAGG T T TAGTT T GT GAAAA 

Hi CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAGTGAT T CCTGCC TCACAGGTGAAGAT TAAAGAGACAACGAATG TGAATCAT GCT T GCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGC T TAT TCCACAT TAAT T TACT T T T CTC TATACCAAAT CACCCAT GGAATAGT TAT T GAACACC 
TGCTTTGTGAGGCTCAAAGAATAAAGAGGAGGTAGGATTTTTCACTGATTCTATAAGCCCAGCAT 
TACCT GATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAAC TACAGGTC CATATCC 
C TC AT TAAC ACAGACACAAAAAT T CTAAATAAAAT TT TAACAAATTAAACTAAACAAT AT AT T TA 
. AAGATGATATATAACTACTCAGTGTGGTTTGTCCCACAAATGCAGAGTTGGTTTAATATTTAAAT 
ATC AACCAGT GTAAT TCAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVL ILVLS FYELVSGQWQVTGPGKFVQALVGEDAVFS CSLFPE TSAEAME VRFFRNQ FHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKI QAEL DWRRKHGQAE LRDARKHAVE VT LD PE TAHPKLCVS DLKT VTHRKAPQEVPHS EKR F 
TRKS WASQG FQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLS PNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 



App_K>=10063579 



Page 256 of 320 



CCT T C AC AGGACTCT TCAT T GCTGGTTGGC AATGATGT AT CGGCCAGATGTGGT GAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACTGT T CAT T ATGTGAGAT AT AATC AAAAGAAGACCT AC AATT ACT AT AGCAC AT TGTCAT TT AC AAC 
TGACAAACT ATATGCTGAGT TT GGCAGAGAGGCT T C TAAC AATT TT AC AG AAAT GAGCCAGAGACTT G AAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAAC AGAAGCATGGAGT GT TGGCTC AT AT GCTGTT GAT T TGT AGATTTCACTCTACTGAGGATCC TG A 
AACT GT AGAT AAAATTGT TC AACT T GTT TT AC AT GAAAAGCT GC AAGATGCT GT AGGACCCCCT AAAGT AG 
AT CCTCACTCAGTT AAAATT AAAAAAAT CAAC AAGACAGAAACAGAC AGCTATCT AAACC AT TGCTGC GG A 
AC ACGAAGAAGT AAAACTCT AGGT CAGAGTCT CAGG ATCGTT GGTGGG AC AGAAGT AGAAGAGGGTGAAT G 
GC CCTGGCAGGCTAGCCTGC AGTGGG AT GGGAGT C ATCGCTGT G GAGC AACC TT AATT AATGCC AC AT GGC 
TT GT GAGTGCTGCT CACTGT T TTACAAC AT AT AAGAACCCTGCCAGATGG AC TGCTTCCTT T GGAGT AAC A 
AT AAAACCTTCGAAAATGAAAC GGGGTCTCCGGAGAAT AATT GT CC ATGAAAAAT ACAAACACCC AT C AC A 
TGACT ATG AT AT TT CTCTTGCAGAGCTTTCT AGC C CTGTTCCCT AC AC AAAT GCAGTAC ATAGAGTT T GTC 
TCCCTGATGCAT CC TATGAGTTT C AACC AGGT GAT GTGAT GT T T GTGAC AGGATTT GGAGCACT GAAAAAT 
GATGGT T ACAGT CAAAATCATCT T CG AC AAGC AC AG GTGACTCT C ATAGACGCT AC AACTTGCAATGAACC 
TC AAGCTT ACAATGACGCCAT AAC TC CT AGAATGT T ATGT GCT GGCTCCT TAGAAGGAAAAACAGATGCAT 
GCCAGGGT GACT CTGGAGGACC AC TG GTTAGT TC AG ATGCT AGAGAT ATCTGGT ACCTTGCT GGAAT AGTG 
AGCT GGGG AGAT GAATGTGCGAAACC C AACAAGCCT GGTGT T T AT ACT AG AGT T ACGGCCTT GC GGGACT G 
GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGT GTGGAGGCCATT TTT AGAGAT AC AG AATT GGAG AAGACT T GC AAAAC AGCT AG ATTTGACT GATCTC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACT CT GT CAT CTGTGAGCAAT AGTTGAAACT TT AT GT AC AT AG AG AAAT AG AT AATACAAT ATT AC 
AT TAC AGCCTGT AT T C ATTT GTT CT C T AGAAGTT T T GTCAGAAT TT TG ACT T GTTG AC AT AAAT T T GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAAC AAGG AGTGAAAG AAAAT ATAAG AAGAAAAAAATCCCCT ACAT TTT ATT GGC ACAGAA 
AAGT ATTAGGT GTT TTTCTT AGTGGAAT ATT AGAAATG ATCAT ATT C ATT AT GAAAGGTC AAGC AAAG AC A 
GCAGAATACC AATCACT TCATC AT T T AGGAAGT ATGGG AACT AAGT T AAG GAAGTCCAGAAAGAAGCC AAG 
AT AT ATCCT TAT TT TC ATTT CCAAAC AACT AC TAT GAT AAAT GT GAAG AAGATT CTGTTTTT TT GTGACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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MMYRPDWRARKRVCWEPWVI GLVI FI SLIVLAVCI GLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AE FGREASNNFTEMS QRLESMVKNAFYKS PLREE FVKSQVIKFSQQKHGVLAHMLL I CRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRGGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI T PRMLCAGSLEGKT DACQGDSGG PLVS SDARD I W YLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 1Q7 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGG 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT G T TCTCACT T TAAC T GCAGAAACGAGGGTAGAGGAAGCAGT CAT T T TGACT TACT T TCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACT GGCT TGTGGCG T T T GGACATATGAACAGGAACT TAT GGT TCCAGTACAAT GGT CA 
GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
AAATGACAGAGAT GGACTGGCCCC CAGAT TCCTGCTGTGT T AGAGAAT TC CCAGGATGTT CCAAA 
CAGGCCCACCAGGAAGATCTCAGT GACCTTTATCAAGAGGGTT GTGGGAAGAAAAT GTATTCCTT 
TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GATGTCC T T GAAGAATGACAAC TC T CAGCACC TGT CATG T CCCT CAGTAGAACT 
G T T GAAACC AAGCC TGTCAAGAATC T T T GAACACACATCCATGGCAAACAGC TT T AAT AC ACACT 
T TGAGAT GGAGGAG T T ATAAA AAGAAATGTCACAGAAGAAAAC CACAAAC T TGT T TTAT T GGACT 
TGT GAAT TTT TGAGTACATACTATGTGTTTCAGAAATAT GTAGAAATAAAAATGTTGCCATAAAA 
TAACACC T AAGCATATAC TAT T C T ATGCT T TAAAATGAGGATGGAAAAGT T TCATGT CATAAGTC 
ACCAC C T GGACAATAAT TGAT GCCCTTAAAAT GCT GAAGACAGATG TCATACCCACT GTGTAGCC 
TGT G TAT GAC T TTTACTGAACACAGT TAT GT T T TGAGGCAGCATGGT TTGATTAGCAT TTCCGCA 
TCCATGCAAACGAGTCACATAT GGTGGGACTGGAGCCAT AGTAAAGGTTGAT T TACT T CTACCAA 
C TAG T AT ATAAAGTAC TAAT T AAAT GCTAACAT AGGAAGT TAGAAAATAC TAAT AAC T TT TAT TA 
C TCAGCGATCTAT TCT TC T GAT GCTAAATAAAT TATATATCAGAAAAC TTT CAATAT T GGT GAC T 
ACCTAAATGTGATT T T TGCTGGTTACTAAAATATTCT TACCACTTAAAAGAGCAAGCTAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T TCAGTT C T GAT AAT G T TAAGAAT AACCAT TATGAAAAGGAAAAT T T GTCC TGTAT AGCATCAT T 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
T GT TATT TAAATAC T TAACCAC TAAT TT TGAAAATTACCAGTGTGATACATAGGAAT CAT TATTC 
AGAAT GT AGTC T GGT C TT TAGGAAG TAT TAATAAGAAAAT T TGCACATAACT TAG T TGAT TCAGA 
AAGGACT TGTAT GC T GT T T T TC TCCCAAAT GAAGACTCT T T TT GACAC TAAACACTT T T T AAAAA 
GCT TATCT T T GCCT TC TCCAAACAAGAAGCAATAGT CTCCAAGTCAATATAAAT TCTACAGAAAA 
TAGTGTT C TTT T TC TC CAGAAAAAT GC T TGTGAGAAT CAT T AAAACATGT GACAATT T AGAGAT T 
CTT TGTT T TATT TCACTGATTAATATACTGTGGCAAATTACACAGATTATTAAATTTTT TTACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
TTCTCAGAATATGGAAAGAAAAT TAAAATGTGTCAATAAATAT T TTCTAGAGAGTAA 
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MAREDSVKCLRCLLYALNLLFWLMSISVL^ 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGWTYEQEIMVPVQWS DMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYS FLRGTKQLQVLRFLGIS IGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CT T TC T C GTGAGAGC CT AGAGGCCTT AAAAAAAAAAGT GCT T GAAAGAGAAG GGGAC AAAGGAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC AC T GACC ACCAGTT GGAT GAGTCT CAAGGTCCT CCT CT AT GT GACAACC AT GT GAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGAT GCC AT GCCT ACCT TCT GCATACCAGAAAACCACT GTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
C AGC T GCT C AG AT AC C AGC GAGT GCAC AT GC GC T C C AGGAACT GT GC T AGGC CCT GACAG GC AGACAT GCT T T G 
ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCTGT 
GAGTGT GGGGT TGGCCGTGT GCT AAGAAGT GAT G GC AAGACT T GT GAAGACGTT GAAGGAT GCC ACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
CT GGT T GGTGGCCT GGAGC T CT TCCT GAC C AAC AC CT CCT GCC GAGGAGTGT CC AAC GGC AC C C AT GT C AAC AT 
CCT CT TCT CT CTCAAGAC AT GTGGTACAGT GGTCGAT GTGGT GAAT GAC AAGAT T GT GGC CAGC AACC TCGT GA 
f'\ CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
jjl ACC T GC GAGT T TCC AC GCCT GTAC ACC AT T TC T GAAGGAT AC GT TC C C AACC TT C GAAAC TCC C C ACT GGAAAT 

C'j CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 

U| GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 

f* % 

r5 5 



m 

\ * I 

iff 



• " 5 



TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGTCT AC AGGGC CAGAC GCT AAC AGGC GGCCC GATC CGC AT C GAC T GGGAGGACTAGT TCGT AGCC AT AC CT C 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT C AG AAGC T GGGTAT AAT AT TT C AAGTT AC AAAC C CT AGAAAAAT T AAA C AGT T AC T GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAATTT GGAAGT GTATCAAT AAAACAGTATAT AATTT T 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHWILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CE FP RL YT I SEG Y VPNLRNS PLE IMSRNHG I FPFTL E I FKDNE FE E P Y REAL P T L KLRDSL Y FG I E PWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein : 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins . 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGOSTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCGT 
CCT GCT GCCTTGGGGTGACAAT CT CAGCTCCAGGCT ACAGGGAGACCG GGAGGAT C AC AG AGCC AGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGT CCT CAT CAAGGT GATT CT GG AT AAAT ACT ACTTCCT CT GCGGGC AGCCT C TCC ACT TC ATCC CGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CC CGAAGGGCCT GCAGTGGC AGTCCGCCT CTCCAAGGACCGAT CC ACACT GCAGGTGCTGGACT CGGC C AC 
AGGGAACT GGT T CT CTGCCT GTTT CGAC AACT TC AC AG AAGCTCTCGCTG AGAC AGCC TGT AGGC AGATGG 
f'J\ GCT AC AGC AGAGCT GTGGAGATTGGCCCAG ACCAGG AT CTGGAT GTTGTT GAAATC AC AGAAAACAGC C AG 

f|] GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 

■ .- 

i »■ 

LlJ TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

TC AGCATCCAGT AC GACAAACAGC ACGTCT GT GGAGGG AGCATCCTGG ACCC C CACTGGGTCCT CACGGC A 

t 

\ \ GC CCACTGCT TC AGGAAACATACC GATGTGTT CAAC TGGAAGGT GCGGGC AGGC T C AG AC AAAC T GGGC AG 

CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GAT GTCTG ACAT ACT GCTGC AGGCGT C AGTCC AGGTCATTGACAGC AC ACGGTGC AAT GC AGACGATGCGT 
\ i \ ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
CI GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
fll CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 

GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TT CTT GGAG C AGCAAAGGGC CT CAATTCCTGT AAGAGACCCT CGC AGCCCAGAGGCGCCC AGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTAT T GCTAAGCC AAGAAGGAACTTTCCC AC ACT ACT GAATGG AAGC AGGCTGTCT T GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GC CT ACTAGAGCAAGAAACC AGTT GT AATATAAAATGCACTGCC C T ACTGTT GGT ATGACT ACC GT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 



V < W 
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MLQDPDSDQPLNSLDVKPLRKPRI PMETFRKVGI PI I IALLSLAS I I I WVLIKVILDKYYFLCG 
QPLHFI PRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEAS VDS WPWQVS I QYDKQHVCGGS I LDPHWVLTAAHCFRKHT DVFNWKVRAG SDKL 
GS FPSLAVAKI 1 1 IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMS DI LLQASVQVI DS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDS GGPLMYQS 
DQffiH WG I VS WG YGCGGP S TPGVYTKVS AYLNW I YNVWKAE L 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
%1 CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
r% AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 

WSJ? 

fll GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 

r f 

LlJ ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 

CAGAAGGGGT CTGC TTAGAC CACCTGGTT TATGTGAC AGGAC T TGCAT T C T CCT GGAACATGAGG 
GAACGCCGGAGGAAAGCAAAG T GGCAGGGAAGGAACT TGTGCCAAAT TATGGGT CAGAAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGGAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGAGAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C TCAATT TAAATCATGT TCTAGTAAT TGGAGCTGTCC CCAAGACCAAAGGAGCTAGAGCT TGGT T 
CAAAT GATCTCCAAGGGCCCT TAT AC CCCAGGAGACT T T GAT T TGAAT T TGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAGCTACT CGGGAGGCTGAGACAGGAGAAT TACT TGAACCT GGGAGGT GAAGGAGGCTGAGACA 
GGAGAAT CACT TCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAAAAATAAAAAAAGAATTA 
TGGT TAT TTGTAA 
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MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 



CAGCAG T GG'1'CTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACT G TGTGCTGAGAGACCATGGCAA 
AGAAT CCTCCAGAGAATTGT GAAGAC TGTCACATTCTAAAT GCAGAAGCT T TTAAA.TC CAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
G T T T T GGGGGAGCAAGCAC T TC TGGC CGGAGGTAC CC AAAAAAGCC TATGACATGGAGCACACT T 
TCTACAGCAATGGAGAGAAGAAGAAGATTTACATGGAAATTGATCCTGTGACCAGAACTGAAATA 
TTCAGAAGCGGAAATGGCACT GAT GAAACAT TGGAAGTGCACGACT T TAAAAAC GGATACACTGG 
CAT C TAC T T CGTGGGTCT TCAAAAAT GT T TTAT CAAAAC TCAGATTAAAG T GATTCCT GAAT TTT 
C T GAACCAGAAGAGGAAATAGATGAGAATGAAGAAAT T ACCACAAC T T TC TT TGAACAGT CAGTG 
AT TT GGGT CCCAGCAGAAAAGCCTATTGAAAACCGAGAT T TT C T TAAAAAT TCCAAAAT TCTGGA 
GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
AC TTT GAG G AGGAG GGAGAAG ATC T T CAC TTTCCTGC CAAC GAAAAAAAAGGGAT T GAAC AAAAT 
GAACAGT GGG T GGT CCCTCAAGTGAAAGTAGAGAAGACC CGT CACGCCAGACAAGC AAGT GAGGA 
AGAACT T CCAATAAATGACT AT ACTGAAAATGGAATAGAATT T GAT CCCATGCT GGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MAKNPPENCEDCHI LNAEAFKSKKI CKSLKI CGLVFG I LALTL I VLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 

EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites . 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCT C CCCT CAGGAGCGCGTT AGCT TC AC AC CT T C GGCAGC AGGAGGGCGGCAGCTT CTCGCAGGCGGC A 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCC GGCTGCAT CGCGGCCACCGGGATGGACATGT GGAGCAC CC AGGACCTGTACG AC AACCCC GTC ACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTT C ACCATCCT GGGACT T CCAGCC ATGCT GC AGGCAGTGCGAGCCCTGAT GATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
; t TC TGTGTT TGCCAAC ATGCT GGT GACTAACTT CT GG AT GTCC ACAGCT AACAT GT AC ACCGGCAT GGGTGG 

'f* % GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
f*1 TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
fll GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
Uj TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
Ul CT TCC AAGCACGAC T ATGTGTAAT GCTCT AAGACCTCTCAG CACGGGCGG AAGAAACT C CCGGAGAGCTCA 

ff CC CAAAAAAC AAGGAGATCC CAT C T AGATTTCT T CTTGCTT T TGACTC AC AGCT GGAAGTTAGAAAAGCCT 

**** CGATT T CAT CTT TGGAGAGGCCAAAT GGT CTT AGCCTC AGT C T CTGTCTCT AAAT ATTCCAC CAT AAAACA 

% n GCTGAGTT AT TT AT GAATTAGAGGCT AT AGCT CAC ATTTT CAAT CCTCTAT T TC TTTTTT TAAAT AT AACT 

TTCTACTCTGAT GAGAGAATGTGGTTTTAATCTCTCTCTCACAT TTTGATGATT TAGACAGACT CCCCCTC 
TT CCT CCT AGTC AAT AAACCCATT GATG ATCT ATTTCCCAGC TT ATCCCC AAGAAAAC TTTT GAAAGG AAA 



11 

r - * " 



! , I GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

Sal 4 

P ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
!lj ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 

TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT T GTCCC T AGATG AAT GAGAAAATT AT TTTT TT T AATT TAAGTCCTAAAT AT AGTT AA 
AATAAATAAT GT TT T AGT AAAATGAT AC ACT ATCT CTGTG AAAT AGCCTC AC CCCT AC ATGT GGAT AGAAG 
GAAATGAAAAAATAATTG CT T T GAC ATTG TCT AT AT GGT ACT TT GT AAAGT CAT GCTT AAGT ACAAATTCC 
AT GAAAAGCTCACACCTGTAAT CC T AGC ACTT T GGG AGGCT GAGGAGGAAGGAT C ACT TG AGCC C AGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
T GGT GGCAT ACACC T GT AGT CCCAGC ATT CCGGGAGGCTG AGGT GGGAGGAT CACTTGAGCC C AGGGAGGT 
TGGGGCTGCAGT GAGCC ATGAT CAC ACCACTGCACT CCAGCC AGGTGAC AT AGCGAGATCCT GT CT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 

MS T TT CQWAFLLS I LGLAGCI AATGMDMWS TQDL YDNPVT SVFQYEGLWRSCVRQS SGFTECRP 
YFT ILGLPAMLQAVRALMIVGIVLGAI GLLVS I FALKCIRI GSMEDSAKANMTLTSGIMFI VSGL 
CAI AGVSVFANMLVTNFWMS TANMYT GMGGMVQT VQTRYT FGAAL FVGWVAGGLTL I GGVMMC IA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTGATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
AC T CAAT AG T GAATG T T GCC C AAAAAC GT GAGC TT G GAGAAG CT C T CTACT T AGGAT GGAC CACG GC A 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACT CGAT ACCT T CCCAT CGCACAACCCAAAAAAGT TATCACAC CGGAAAGAAGT CACCGAGCG 
TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CTTAATTACAGGAACTGTGCATCAGCTATTTAT GATT CTATAAGCTATTTCAGCAGAATGAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCAT TATGTACAT AGATGAGT GTAACAT TTAT ATCTCACAT AGAGAC ATGC TTATAT GGT 
TTTATTTAAAATGAAATGGCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
AT CAT GGAT AGGGTT GAAGAAGGTT AC TAT T AATT GT TT AAAAACAGC T TAG G GAT TAAT GTCC T CCA 
T T T ATAATGAAGATTAAAATGAAGGCT TTAAT CAGCATT GTAAAGGAAATTGAAT GGCT TTCTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CT T GTGTAT TAAATTAACATTTTTAAAAC GC AGATATTT T GTCAAGGGGCTTT GC ATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT TT TTGTTTTT GTATTTGAAGAAGAATGAT GCAT TTTGACAAGAAAT CAT ATATGTAT GGAT 
ATAT T TT AATAAGTAT T T GAG TACAGAC T T T GAGGT TTC ATCAAT ATAAATAAAAGAGC AGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAAC^CAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATTTATAT TAATAAATTGTACATT T T TCTAATT 
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FIGURE 120 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
IYDSLIALSPDLQAARGLMCAASVMSFLAO 

MVVLIPVSWVAMAI IRDFYNS I VNVAQKRELGEALYLGWTTALVL I VGGALFCCVFCCNEKSS S Y 
RYS I PSHRTTQKS YHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 



Transmembrane domains: 

«q amino acids 82-101, 118-145, 164-188 



.S : 



fll 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAAT G TCTGAGGGAAAGCT T T GAGGAGTCC TGGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT GAAGCTATAATTTATTTGGACCAAGGAAGCCCTGAAATGAATTCAACAAT TAATATT CATCG 
CACT T CT TCTGTGGAAGGACT T TGT GAAGGAAT TGGT GC TGGAT TAGT GGATG T T GC T AT CT GGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
TTGGAATGGT TCAC TTAAATGACATT TTAAATAAGT T TATGTATACATCTGAATGAAAAGCAAAG 
C TAAATATGT T TACAGAC CAAAGT G T GAT T TCACACT GT T TT TAAATCTAGCAT TAT T CAT T TTG 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAAT CTTT GTACAATTTGTAAATGTTAAGAATTTT TT TTATATCTGT 
TAAATAAAAATTATTTCCAACA 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVT PGTPGI PGRDGFKGEKGECLRE S FEE S WTPNYKQCS WS S LNYG I DLGKIAECT FTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

^! Transmembrane domain: 

m amino acids 195-217 

m 

•*« ■■ - 

.ft : 
■-AS 

a 

j3»# ■ _ 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCT.GCCTTCTGGGCTCCflACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCCT GGAAGAATACATCAT GT T T T T C GATAAGAAGAAAT TGTAGGAT CCAGT TTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TAT TACCTAGGAAGAT T TTGAT GT T TTGC TGCGAAT GCGG T G T T GGGAT T TAT T TGT T CT TGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCT TTGAACAATACAAAGGATGGGT TT CAATG 
TAAT TAGGCTACTGAGCGGATCAGC TG TAGCACTGG TTATAGC CCCCAC TGTCT TACT GACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
AT CTCAGAAAT TACAGGAGATACCC TCAAGTATATC TGC TGGT TGC T TAGGTT T GT CC CT TCGC T 
ATAACAGCCT TCAAAAAC T TAAGTATAAT CAAT T TAAAGGGCT CAACCAGCTCACCTGGC TAT AC 
CT TGACCAT AACCATAT CAGCAATATTGACGAAAAT GCT T T TAATGGAATACGCAGACTCAAAGA 
GCT GAT T CT TAGT TCCAATAGAAT CTCC TATT T TC T TAACAAT ACCT TCAGACC TGTGACAAAT T 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCT T T GC TGGCATGATCAGACT CAAAGAACT T CACCTGGAGCACAAT CAAT TT T CCAAGCT CAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CAT AGGACAGACCATGT CCT GGACC T GGAGC TC CT TACAAAGGC T TGAT T TAT CAGGCAAT GAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCT CACAT TTAT TGGT CAAGAGAT T TT GGAT TCT T GGATAT CCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGT CTAAGGGAGAAT ACAAT TAT C TGT GCCAGTCCCAAAGAGCTGCAAGGAGTAAATGT G 
ATCGATGCAG TGAAGAAC TACAGCAT CT G TGGCAAAAGT ACTACAGAGAGGTT T GAT C TGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGC TGGT TAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAG T CCCTAAAGCAAAT GACT CCCAGCACCCAGGAAT T TT ATGTA 
GAT TATAAACCCAC CAACAC GGAGAC CAGCGAGATGCTG C T GAATGGGACGGGACC CT GCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT T T TCC TC TCATACATAAT CAACCCATTGAAATT TAAATACCACAATCAAT GT GAAGC T T 
GAACTCCGGTT TAATATAAT ACCTAT T GTATAAGACCCT T T AC TGAT TCCATTAAT GT CGCATT T 
G T T T T AAGATAAAACT T CT T TCAT AGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVI RLLSGSAVALVI APTVLLTMLS SAERGCPKGCRCEGKMVYCESQKLQE I PS S I SAGCLG 
L S LRYNS LQKLKYNQFKGLNQLT WLYLDHNH I SN I DENAFNGIRRLKEL I LS SNRI S YFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
IARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWEGSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPEXDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNfETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12* 

CCGT1ATCGTCTTGCGCTACTGCTG AATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTGCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGC T T TCTAAAGCT T TGGCAAGGAGTGACACCCGCCAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAAT CCAACTGAC CTAGTGAAGGTTCAGATGCAAAT GGAAGGAAAAAGGAAAC TGGAAGG 
AAAACCATT GCGAT T TCG TGGT GT ACAT CATGCAT T T GCAAAAATC T TAGCTGAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACT T AT GATACAG TGAAACAC TACT TGGTAT T GAATACAC CAC T TGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAAT CAAC CACGAGATAAACAAGGAAGGGGAC T T T TGTATAAAT CATCG 
AC T GAC T GCT T GAT T C AG GC T G T T CAAGG T GAAGGAT T CAT GAG T C T AT AT AAAGGC T T T T T AC C 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAK I LAEGG I RGLWAGW VP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLS'SLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 

CAAATGAAGAGGACATCC TCTCCATGTTC TCAGC TAT CCG T TC TCAGCACAGCGGTG TAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
M GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 

AGGAGCGGAAT GTGGACGAT GGGCACATCAT TAACATCAAT AGCAT GTCTGGCCACCGAGTGT TA 
Z CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 

*> » 

V t 

V • 

if GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
f?1 AGACACAAT T CGCCT TCAAAC T CCACGACAAGGAC CCTGAGAAGGCAGC TGCCACCTATGAGCAA 

ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CCAGAT TGGAGACATCCAGATGAGGCCCAC GGAGCAGGTGACC TAG TGACT GT GGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACT TCCTGTCCACACCCCGACCAGGGGCTAGAAAAT T T GT T T GAGAT TT T TATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATT TCATGGTGAT CATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 



his 
* * 



: 5 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGI GAAVARALV^ I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPOT^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTXCTAC ATG GGCCl'CCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 

CAT GTCCCT CCCACCCTCCTTT GACTGCGGGCCGT TCAGGT GCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAATTTTT AACTGATAGTT GT ACAT AT TTGGGGGTACATGTGATAT TTGGATACATGTATACAA 
TATATAATGATCAAATCAGGGTAACTGGGATATCCATCACATCAAACAT T TAT TTTT TAT TCT TT 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAAT GCCCAAC TAAT TT TT GTAT TT T TAGTAGAGACGGGG TT TTGCCATGT TGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCT CT GAT CACC TCAT TC TACT C T CTACC TCCATGAGATCCACT T TT T TAGC TC CCACATGT G 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TC CAT C CAT GTTGC TGCAAATGACAGGATT T CGT TCT TAATT TCAATTAAAAT AAGCACACATG 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRS FCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site* 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lai 



< * i 



TT CT GAAGTAACGGAAGCT ACCTT GT AT AAAGACCTCAAC ACTGCTGACCATOATC AGCGCAGCCT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGAT GCTACAACTCTCTACCTTCAGAACAACCAAAT AAAT AATGCTGGGATT CCTTCAGAT 
T T GAAAAACT TGCT GAAAGT AGAAAG AAT ATACCT ATACC AC AAC AGTTT AGAT GAATTTCCT ACC AACCT 
CCCAAAGT AT GT AAAAGAGT TACATTTGCAAGAAAATAAC AT AAGG ACTATC ACT T AT GATT CACTTTCAA 
AAAT TCCCTAT C TGGAAG AATT AC AT TT AG AT GAC AACTCTGTCT CTGCAGT TAGC AT AGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCC AGGACT AT AGAAGAACTACGCTTGGATGAT AATCGC AT AT CC ACTATT TC ATCACCAT CT CTTC AAG 
GT CT C ACT AGTCTAAAACGCCTGGTTCT AG AT GGAAACCTGT TGAAC AATCATGGT TT AGGT GAC AAAGTT 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 



I i \ TCCAGG C ACAAACC TG AGGAAGCT TT ATCT T C AAG AT AAC CACAT C AATCGGGT GCCC CCAAAT GCTTTTT 



CT TAT C TAAGGC AGCT CT AT CGACTGGAT ATGTC CAAT AATAAC CT AAGT AATT T ACC TCAGGGT ATCTTT 



"'*f GATGAT TT GGAC AAT ATAAC AC AACTGATT CT TC GC AAC AAT CC CTGGTATT GC GGGTGCAAGAT G AAATG 

h M GGTACGTG ACTGGT TACAAT CACT ACCTGT GAAGGTCAACG T GCGTGGGCTC AT GTGC CAAGCCCC AGAAA 

^ ^ AGGT T C GTGGGAT GGCTATT AAGGATCTCAAT GCAGAACTGT TT GATTGT AAGGAC AGTGGGAT T GT AAGC 

**A ACCAT T CAGATAAC C ACTGCAATACCCAAC ACAGTGTATCC T GCCC AAGGAC AGT GGCCAGCTCCAGTGAC 

CAAAC AGCCAGAT ATT AAGAACCC CAAGCT CACT AAGG ATCAAC AAACCACAGGGAGT CCCT CAAGAAAAA 
CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
ACTGCT TTGAGACT C AGCTGGCT TAAACTGGGCC AT AGCCC GGC ATTTGGAT CT AT AACAGAAAC AATTGT 



/|j AACAGGGGAACGCAGTGAGT ACTT GGTCACAGCCCTGGAGCCTGAT TC ACCCTAT AAAGTAT GC AT GGTTC 

CC AT GG AAACCAGC AACCTCTACCT ATTTG AT GAAACTCCTGTT T GT ATT GAGACT GAAACT GC AC CCCTT 
CGAATGT ACAACCCT ACAACCACCCTCAAT CGAGAGCAAG AGAAAG AACCTT AC AAAAACCCCAAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AATGGATCGCTCTTCT CAAGGAAC TGT GC AT ATAGC AAAGGG AGGAGAAG AAAGGATG ACTAT 
GC AGAAGCT GGC ACT AAGAAGGACAACTCT AT CCTGGAAATC AGGGAAACTT CT TT TC AG AT GT T ACC AAT 
AAGC AATG AACCCATCTCGAAGGAGGAGTT TG TAAT AC ACAC CAT ATTTCCT CCT AATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TC AC AC TC ATGAT GCTGAAGGACT CACAGCAGACTTGT GTTT TGGGTTTT TT AAACCT AAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVS AVS IEEGAFRDSNYLRLL FLS RNHLST I PWGLPRT IEELRLDDNRI STISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVWLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAI KDLNAE L FDCKDS G I VS T I Q I T T AI PNT VYPAQGQfflPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVT IALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 13 3 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
T GCAGAGGCAGTCTGGGC T TGGCCAGAGC TCAGGG TGCTGAGCGTG T GAC CAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C TCCACTACCTCAAGC T GAGT GATCCCAAGTACCTAAGAGAGTTCCAGCT GACCC TCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCAC TCCAACAGAGCCAGGAC TATATCAACC TC T T C T GCGCCAACATGATGGACT T GAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT TCGGGGAGCCT GAT GCTGAAGAT GAAGAAT TAT CTAAAGCTAT T CAA 
TAT CAGCAGCATT T T TCGAGGAGAGT GAAGAGGCGAGAAAAACAAT T TCCAGAT TCTCGCTC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGC TGAGACGGACGGT X CCAT GCCAGCT GCC TGGAGGAGGAACAGACCCC 
T TT AGTC CT CATCC C T TAGATC CT GGAGGGCACGGAT CACAT CC TGGGAAGAAGGC AT CT GGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERA.TVFLEQRLPEINLDGMVGWVLEEQLKSWEKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 



Transmembrane domain: 

*M amino acids 39-56 



i'ri Tyrosine kinase phosphorylation sites . 



amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site, 
amino acids 10-14 

Glycoprotein hormones beta chain signature 1 

amino acids 230-237 
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FIGURE 135 



GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGT TT TGTGGT TCAT GATATACCT TC T GGATCT TATGTAGTGGAAG TTGTATC TCC 
AGCT TACAGAT T TGATCCCGT T CGAGTGGATAT CACTTC GAAAGGAAAAATGAGAGCAAGATATG 
T GAAT TACATCAAAACATCAGAGGT T GT CAGAC TGCCCTATCCTCTCCAAATGAAATCT TCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TAT GATGATGGT TC T TCC T TTAT T GATAT T TGTGC T T CT GCCTAAAGTGGTCAACACAAGTGAT C 
C TGACAT GAGACGGGAAATGGAGCAGTCAAT GAATAT GCTGAATTCCAACCATGAGT T GC C TGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGTCAATAT TGATG TCACTGAAT T AAT TACAGTG T CC TATAGAAAATGCCAT TAAT AAAT TAT 
ATGAACTACTATACAT TAT GTATATT AATTAAAACAT CT TAAT CCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKT DGS FWHD I PS GS YWEWS PAYRFDPVRVD I T SKGKMRARYVNYI KTS E 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMMHV^ 
QSMNMLNSNHEL PDVSE FMTRL FS SKS SGKS S S GS SKTGKS GAGKRR 



Important features of the protein: 
Signal sequence: 

W* amino acids 1-23 



Transmembrane domain: 

r 

amino acids 161-182 

•sj N-glycosylation site* 

W amino acids 184-187 



Glycosaminoglycan attachment sites, 

amino acids. 37-40, 236-239 

cAMP- and cOdP-dependent protein kinase phosphorylation site 

amino acids 151-154 



N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGC TACGT TGGC TT T CTGGAAGGGGAGGCT ATATGCG TCAATT CCCC AAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
C T TAC CT GCTGGGCAC TAACGGCGGAGCCAGGAT GGGGACAGAATAAAGGAGCC ACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
f*jb AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
C) TGCAAGCCAAAGATGGAAACAT TGACATCAGAAT C T T AAGGAGGACTGAGTCT T TGCAAGACACA 

M AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 

.2S ?- 

• Si 

H| AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 

111 CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 

%l AAGAAATACAGCCAGAT T CTGAGT CACT T TGAAAAGC TGGAACCTCAGGCAGCAG TTGT GAAGGC 

£1 TTT GGGGGAACTAGACATTCT T C T GCAAT GGATGGAGGAGACAGAATAGGAGGAAAGT GAT GC TG 

* C TGC TAAGAATAT TCGAGG T CAAGAGC TC CAGT CTTCAATACC TGCAGAGGAGGCAT GACCCCAA 
\l\ ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
Q CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
l|| TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
•K AT GT AT T TAT T T T T T TAG T T GGACAT GAAAC T T TAAAAAAAT T CACAGAT TATAT T TAT AACC T G 

n \ 

ACTAGAGCAGGT GAT G TAT TT T TATACAGTAAAAAAAAAAAACCT T GTAAAT TCTAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCT TACAAT TACTGACCATCCCCAGTAGACTC CCCAGT CCCATAAT TGT G TAT CTTCCAG 
C CAG GAAT C C T ACACG GC CAG CAT G TAT T T C TACAAAT AAAGT T T T C T T T GC AT ACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 15*8 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKJ^KASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLANS FLT IKKDLRLSHAHMTCHCGEEAMKKYS QI LSH FEKLE PQAAWKALGE LDI LLQWMEET 
E 

Important features of the protein: 
Signal peptide : 

amino acids 1-42 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

US J 

111 amino acids 192-195, 225-228 



N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T T TG TGGC TGAAGCAAAGCCACC T ACAGT TACTATGC CT C GAATCAAGGCAT TGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTCTTTTATGGAGATGAAACCTGGGTTAAATTA 
T TCCCAAAGCAT TT TGT GGAATAT GATGGAACAACCT CAT T T T TCG TGTCAGAT TACACAGAGGT 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
CI GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
Q GGGCCTC CT CCACCGAGGAGGT GAATACACCTC TGAT T T TAAT CAGT T C TGCG T T TGAAAGGAAA 

ff| CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
f! J ACTTGGCTT ACCGAT TCCAAAAGACAGTGTAGGGAGCCTCCTAT TCCCAGTTGT GGAAGGAAGAC 

CAAT GAGAGAGCAG T TGAGAT T TT TACAT T TGAATACAGT GCAGCT TAGTAAAC TGT T GCAAGAG 
yj AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
%l GAAC T GGAT CAGAC T G T AC T T GGAG GAAAAGCAT T CAG AAG TC C TAT T CAAC CTGGGCTC CAAGG 

Ji TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
■J" TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 

CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
Q CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
tfl GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
f*\ AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
T*\ TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
% CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
'M TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
f|j CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 

GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACT GCAGTGGCAGCAGGACT GTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCT TGCACCTCGGGCCCATCTGGGC TCATGC TCTCTCTCCTGCTATTGAAT TAGTACCTAG 
C TGCACACAG TATGTAGT TACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 
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FIGURE 14Q 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFV-FGSKGVKFMPYTTYLTOKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGAS S T EE VNTPL ILIS SAFE RKPGD I RHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 



N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 




Transmembrane domain: 



amino acids 58-76 



N-glycosylation sites. 

amino acids 56-60, 194-198 



Amidation site. 



amino acids 154-158 



Cell attachment sequence. 



amino acids 205-208 
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FIGURE 141 
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GGCACGAGGCAAGC CTTCCAGGT TAT CGT GACGCACCT T GAAAG TC TGAGAGCTACTGCC CTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACT GATG TTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAAT TCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCT GGGAAT CTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTT CTTT GCATTAGCCTCATCCTT GAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGT GACAAGGAT AAAGGACAAAGT CATCCATCCCT T CAGC T GAAGAAGGAGAAACT GATGAAGC T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCT GGAGTCGGCGGCT CACCCCGGATGGT TCATCTGCACCTCCTGCAAT T GT AATGAGCCT 
GTT GGGGTGACAGATAAAT TTGAGAACAGGAAACACATTGAAT TTTCATTTCAACCAGTT TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 



IjjJ ATT TGAACTAATTGTATAAAAACACCAAACC TGCTCACT 
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FIGURE 142 

MLLLLLE YNFP I ENNCQHLKT THT FRVKNLNPKKFS I HDQDHKVLVLDSGNL I AVPDKNY IRPE I 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGAT ggcagatgagtgact ccacatccagagctgcctc cc t t taa 

TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAAC TAGGTGTGGGAAAT T TGAAGACGACAT T GACAAC TGCCAT T T CCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACAC AT C T T GAG C C T AAT CATG T AG T G T AGAT C AT T AAAC AT C AGC AT T T T AAG AAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



\ '* 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQ 

YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 



AppJD=10063579 



Page 296 



FIGURE iar 



C TGTGCAGCTCGAGGCTCCAGAGGCACAC TCCAGAGAGAGC CAAGG T TCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACA.GAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAG T T C CAGAAGCCAGACAAC AAGCTCCACCAGCAGG T GCT CTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCGTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCT TGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCC TGGCAAG T GACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCAGGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAAC ACATCAGGCACT GCGCCACC TGC T TCACAGTACT T CCCAACAACT C T TAGAGGTAG 
G TGTATT CCCGTT TTACAGATAAGGAAAC TGAGGCCCAGAGAGCTGAAGTAC T GCACC CAGCAT C 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CTCTAATGAAATTGTGAAAGCTCCATGTTTAGAAATAAATGAAAACACCTGA 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPS TAQI TEAQVAENRPGAFIKQGRK 
LD I D FGAE GNRY YEANYWQ FPDG I HYNGC S EANVTKE AFVT GC I NAT QAANQGE FQKP DNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 



Important features of the protein: 
Signal peptide: 

amino acids 1-26 
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Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site 



amino acids 76-83 

s i 



N-myristoylation sites . 

amino acids 71-77, 88-94/ 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTT TCAGAT T CAGGGGGTACATGTGAAGGT TTG T T T TATGAGTATAT TGCATGATGCT GAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCGCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTGCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGT GCAGAAAGACCTTT GAAT GAACAGAT TGCTGAA 
G CAGAAGAAGAC AAGAT T AAAAAAACAT AT C C T CCAGAAAACAAGCCAGGTCAGAG CAACT AT T C TT T 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGC TCCCCACT T GATAATAAGTTGAATGTGGAAGATGT TGATTCAACCAAGAAT CGAAAACTG 
ATC GATGATTAT GACT CTACTAAGAGT GGATTGGAT CATAAAT TT CAAGATGATCCAGATGGTCTTCA 
T CAAC TAGACGG GACT C C T T T AACCGC T GAAGACAT T GT C CATAAAAT C GCT GCCAGGATT TAT GAAG 
AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGT T TTACAAAAATTAATCTCAAAGGAAGCCAACAA 
TTAT GAGGAGGAT CCCAATAAGCCCACAAGCTGGACTGAGAAT CAGGCT GGAAAAATACCAGAGAAAG 
T GACTCCAAT GGCAGC AAT T C AAGAT GGT CT T G CT AAGG GAGAAAACGAT GAAACAGTAT CTAACAC A 
TTAACCTT GACAAATGGCTTGGAAAGGAGAACTAAAACC TACAGTGAAGACAACT T TGAGGAACTCCA 
ATATTTCCCAAAT TTC T ATGC GCTACTGAAAAGTAT TGAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACTGATT ACTAT CATGAAAACACTGATT GACTTTGTGAAGATGATGGT GAAATAT GGAACAATATCT 
CCAGAAGAAGGT GTTTC CTACCT T GAAAACT TGGAT GAAATGATTGCTC TTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAG T ACCAAGGAAGAAGCAGCTAAGAT G GAAAAGGAATAT G GAAG C T T G AA.G G AT TCCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAzyiCAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAAT AT T GAAT G GT T GAAGAAACATGACAAAA&G GGAAAT AAAGAAG AT TAT GACC TT T 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGC C ATC AAGCGC AT T T AT AGC AGC C T GTAAAAAT GG CAAAAGAT CCAGGAGT CT T T CAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAAT TCTGTGATTAAAATTTTTT GACCCAAGG 
G TT AT TAGAAAG T GCT GAAT T T ACAGT AG T T AACC T T TTACAAGT G G T T AAAACAT AGC TTTCTTCCC 
GTAAAAA.CTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ifso 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRSLSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRS S PLDNKLNVEDVDS TKNRKL I DDYDS TKS GLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKL I SKE ANNYEEDPNKPT S W TENQAGK I PEKVT PMAAI QDGLAKGENDET VSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTL IDFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myri s toyl a ti on si te s : 

amino acids 143-148, 239-244 
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FIGURE isi 



CGGCTCGAGGCTCCCGCCAGGAGAMGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
AT AAT AAC C AGC T T C TAG C T G GAGGG C T GCAT G CAG GGAAGG T CAT TAAAGG T GAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCAGC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAGG GCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
5? TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
Q GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
■ Q GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
'ffl TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
k'l GAAATGGCT CGAGC TCAGAAGATAAAAGATAAG TAGGGTATGCTGATCCTCT T T TAAAAACCCAA 

'fl GATACAATCAAAATCCCAGATGCTGGTCTCTAT TC CCAT GAAAAAG T GCTCAT GACATAT T GAGA 

Ll] AGACCT AG T TAC AAAGTG GC AT AT AT T GC AAT T TAT T T T AAT T AAAAGAT AC C T AT T TAT AT AT T 

\\ TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
L ffj AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 

ATTCCTTGTATAAAAATAAGAAAAGAAATTAATCTTGAGGTAAGCAGAGCAGACATCATCTCTGA 
1,. TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
Q TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
Ul GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
%% GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
f;1 ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 

GTCTCTGCAGATGTAGT TAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAAT TCAATAT 
CI GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
fjj AAGAT GAAGGCAGAGATCGGAGTT T T GCAGCCACAAGCT AAGAAACACCAAGGAT T GT GGCAACC 

ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAG TGATGGCAT GCCACT T CCAAGAT TAGGT TATAAAAGAC AC TGCAGC TTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC T AGCT GCCATGC TATGAGCAGGCC TATAAAGAGACT T AC G TGGTAAAAAATGAAGT C T CC T 
GCC CACAGCCACAT TAGT GAACCT AGAAGCAGAGACT CT GT GAGATAATC GATGT TTGTTGTTTT 
AAGT T G C T C AGT T T T GG T C T AAC T T GT TA T GCAG CAAT AGATAAATAAT AT GCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGPCVIKGEE I SWPNRWLDASLS PVILGVQGGS 
QCLSCGVGQEPTLTLEPWIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAP I TDFYFQQCD 

N-myxistoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-63, 73-78, 91-96, 106-111 

Interleukin-1 signature . 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE isa 



CTTCAGAACAGGTtCXCCTTCCCCAGTCACCAGTTGCTCGAG'ri'AGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGGACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGT TC GTCTCAT TGGGGAGAAACTGT T CCACGGAGTCAGTAT GAGTGAGCGC TGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGT GATGACCTGC AT AT CCAGAGGAATG T GCAAAAGCTGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGTGGAGAGATCAAAGGAAT T GGAGAACT GGAT TTGCTGTT TATGT C T C T GAGAAAT 
GCCTGCAT TTGACCAGAGCAAAGCTGAAAAAT GAATAAC TAAC CCCCTT TCCCT GCTAGAAATAA 
C AAT T AGATGCCCCAAAGCGAT TT T T T T TAACCAAAAGGAAGAT GGGAAGCCAAACTCCATCAT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC T TTCTAAGCATAGATAT TTAT TGAT AACAT T T CAT TG TAAC T GGTGTT C 
SA TAT AC ACAGAAAACAAT TTAT T T T T TAAATAAT TGTC T T T T TC CATAAAAAAGAT T AC T T TCCAT 

a} TCCT T TAGGGGAAAAAACCCC TAAATAGC T TCATGT T TC CATAATCAGTAC TT TAT AT T TAT AAA 

» T GT AT T T ATTAT TAT TATAAGACT GCAT T T TAT T TAT AT CAT T T TAT TAATAT GGAT T TAT T TAT 

AGAAACATCAT TCGATAT TGCTACT TGAG TGTAAGGCTAAT AT TGAT AT TTAT GACAATAAT TAT 
AGAGCTATAACATGT T TAT TTGACCTCAATAAACACTTGGATATCCC 
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FIGURE 1*4 

MAALQKSVSSFLMGTLATSCLLLLALL^ 

ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites, 

amino acids 54-58, -68-72, 97-101 

N-myristoylation sites . 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAA.TAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGT CAGTGCCCGACT TG TGAC T GAGTGTGCAG TGCCCAGCAT GTACCAGGTCAG TGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGG G AG C GAC C CAGAT T AGG T GAG GACAGT T C TC T CAT TAGC C T T T T CC TACAG GT GG T T GC AT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTGCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATC CCGGGACAGGATGGGGGGC TT TGGGGAAAACCTGCACT TC TGCACAT T TT GAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACAT GGAT GATTCTGAGGAGGAAGCTGTTAT TGAATG TATAGAGAT T TATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 



MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-14 0 

Tyrosine kinase phosphorylation site, 
amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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C CG G C G ATG T CG C T CG TGC T GC TAAGCC T GGC C GCG C T G T G CAG GAGC GC C G TAC C C C GAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACC TCCGAGTAGAACCTGTTACAACTAGTGT TGCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT T T GT GTGACGGGCAAAAGCAACT TCCAGT CCTACAGC T GT G TGAGGT GCAATT ACACAGAGG 
CCTTCGAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
C CC T T CCATGTC TGTGAAT TTCACC T CACCAGGCT GCCT AGAC CACATAATGAAAT AT AAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 

TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
Uf GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
% CAT GGGT GC TGGTGGCAGGGAT CT AT CTAATGTGGAGGCAC GAAAGGATCAAGAAGAC TT CC TT T 

TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAAT T TGT TACT TCACT GAAT T TC T TCAAAACCAT T GCAGAAG TGAGGT CATC CT T GAAA 
AGT GGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCT TGCCACTCAAAAGAAGGCA 
y GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
Ul GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GTGAT CT AAGAAGCCAGAT TCATCTGCACAAATACG T GGTGGT CTAC T T TAGAGAGAT TGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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MSL VLL SL AALC RS AV P RE P T VQC GS ETG P S PEV3MLQ H DL I P GDLR DLRVE PVT T S VATG D Y S I LMNV S W V 
LRADASIRLLKATKICVTGKSNFQSY SCVRCNYT EAFQTQTRPSGGKWT FSY IG FPVELNTVY FIGAHNI P 
NANMNEDGPSMSVNE^SPGCLDHIMKYKKKCVKAGSLWDPKITACKKNEETVEVNFTTTPLGNRYMALIQH 
ST 1 1 G F SQV FE P HQKKQT RASWI PVTGDSEGAT VQLT PY FPTCGSDC IRHKGT WLC PQTGVP FPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLWYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 



M Important features of the protein: 

L 5 j Signal peptide: 



amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 



cAMP- and cGMP-dependent protein kinase phosphorylation sites 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 



Amidation site. 

amino acids 488-452 
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AGCCACCAGCGCAACATGACAGT GAAGACCCT GCAXGGC CCAGCCAI GGTCAAGTAC T TGCTGC T 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAAT TACACT GTCACT T GGGAC CCCAACCGGTACC CC TCGGAAG T T GTACAGGCCCAGTGTAGGA 
ACT T GGGC TGCATCAAT GCTCAAGGAAAGGAAGACAT CT CCAT GAAT TCCGT TCCCAT CCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACT CAGC T GAAGAAG 
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FIGURE 160 
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MTVKTLHGPAMVKYLLLSILGLAFLSE^U^KIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRYSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVS FQLEKVLVTVGCT CVT PV I HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 



p 7 

rlj 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCAC TCCGT GCTGGAAGT AGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCC TCC GC CC CC TCTGGAGGC TGAAGAGGGAT TC 

CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCT GACT CAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCT T CGAGGCTGC CC TAGGGAGTGAGGTACGAATC T GGTC CT ATACTCAGC CCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
Q ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
Q GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
f|i CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
'f t CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
■fj CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
|J; GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
%l TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
J% AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 

CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
G TGGCTGTACT TCACTACCCAGCAAAGCC TCCACGAGGGCAGC TCGCC T TGGAGAGTACTTAC TA 
□ CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
\fi CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
p CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
T\ CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 

CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
C'l CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
fjj CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 

GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 

GCGCCTCGCTCAGCTGCGTGCTGCeCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 

CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAATAAAGGCAGACGCTGT T T T TCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLMjGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDEOTEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWAC PMDKY I H KRWALVWLACLL FAAALS LI LLLKKDHAKGWL RLLKQDVRSGAAARG 
RAALLLYS ADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAW FHAQRRQTLQEGGVWLLFS P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 



N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 



Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 




Signal sequence: 

amino acids 1-20 



Transmembrane domain. 



amino acids 453-475 
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FIGURE 163 



GGGAGGGCT CTG TGCCAGCCCC GATGAGGACGC T GCT GACCAT CT TGACTGTGGGATCCCTGGC'i.' 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCC CCACGCCAAT CCGT GCAGGCGAT GGCCACCGGCTAACCCT GGAAGACATC T TC CATGACCT G 
T TC TACCACT TAGAGC TCCAGGTCAACCGCACC TACCAAATGCACC T TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
U. CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
C TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTAGTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
O TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
31 CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
||| GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
Tu GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
] f\ CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
% CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
*JJ TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

CCCTGGGGATTTGCAC^GACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 
^ CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
ffj CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
U'l TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
Q GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
Ql AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
f\ ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
2* AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGGAGCATGATA 
\\l AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAAT GACAGTGCAAGGAGGAAATGCAGGGAAACTC CCGAGGTCCAGAGCCCC ACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGA?VTTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGGAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAAT GCAGGT CCACCAGGGAGGGAGACACACAAGCCTT T T CT GCAGGCAGGAGT T TCAGACC CT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 164 



MRT LLT I LT VGS LAAHAPEDPS DL LQHVKFQS SNFENI LTWDS GPEGT PDTVYS I E YKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
S PS DQGPS PWGLLESLVCPKDEAKS PAPETS DLEQPTELDSLFRGLALT VQWE S 



fp Signal sequence* 

jjj amino acids 1-17 

hi 

\-, ■ 

Transmembrane domain. 



amino acids 233-250 

» \ 

tj'| N-glycosylation sites ♦ 

W amino acids 80-83, 87-90, 172-175 



N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



'i'GG CCTAC T GGAAAAAAAAAAAAAAAAAAAAAAAG T C AC CC GG GC C CGCG GT GG CCACAAC ATGG 

CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATAT GT CT ACTACAAAC TGGCAGGGGGAT CCC T TGAACTTT GGGC TGGA 
AGT GT T GAACACAG T TT T GGAT AT T T TCCA&AAGATT TGAT CAAGG TACT TCATAAATACACGGA 
AGAAGAGCT ACATAT TCCAGCAGATGAGACAGACT T T GT CT GC TT TGAAGGAGGAAGAGATGAT T 
T TAATAGT TATAATGTAGAAGAGC T T TTAGGATC T T T GGAAC T GGAGGAC TC T GTACCTGAAGAG 
TCGAAGAAAGCTGAAGAAGT T T CT CAGCACAGAGAGAAATCTCCT GAGGAGTCT CGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 



m ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
^ S AGACAGATGCT TACAAAGTCC T GAAAACAGAAAT GAGTCAGAGAGGAAGTGGACAGTGCGT TAT T 



CAT TACAGCAAAGGATT T CGT T GGCATCAAAAFCTAAGT T T G T TT TACAAAGAT TGT T T T TAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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MAAAPGLLFWLF^GALWWPGQSDLSHGRRFSDLKVCGDEECSMMYRGKALEDFTGPDCRFW 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSES TEGLQGQPSAQESHPHTSGPAANAQGVQSSLDT FEE I LHDKLKVPGSESRTGNSS PAS VER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCT CAGCCTCTCAC TTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCC T GC CC AGATC TCTACCAGTGGT CT GATGGAAGCAAT T CC CAGTACC GAAAC T GGTACACAG 
H ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
Q CT T GGGGGT CCCTACC T T TACCAGTGGAATGATGACAGGT GTAACATGAAGCACAAT TAT AT TTG 

f% CAAG TAT GAACCAGAGAT TAAT CCAACAGCCCC TGTAGAAAAGCC T TATC T TAC AAAT CAACCAG 

%t GAGAC AC CCATCAGAATG TGG T TGT TAC TGAAGCAGGT ATAAT TCCCAATC TAAT T TATGTT GT T 

f J ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
W GCAT AAAAGTAAAGGAAGAACAAAAACTAGTCCAAAC CAGT CT ACAC TGT GGAT TT CAAAGAGTA 

U1 CCAGAAAAGAAAGTGGCATGGAAGTATAATAAC TCAT TGACTT GGTTCCAGAAT T T TGTAAT TCT 

%l GGATC TGTATAAGGAAT GGCATCAGAACAATAGC TT GGAAT GGCT TGAAAT CACAAAGGATC TGC 

if l AAGATGAACTGTAAGC TCCCCCTTGAGGCAAATATTAAAGTAATTTT TATATGT CTAT TATT TCA 

* M T T T AAAGAATATGC T GT GC TAATAAT GGAGTGAGACATGCT TATT T TGCTAAAGGAT GCACCCAA 

* ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
P TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
jjj T GT AT AT TGTAT TGAAAT T TACAGTGTGCAAAAGTAT TT TACCT TTGCATAAGT G T T T GATAAAA 

m ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
f ?: ! AAAC T TAT T ACTGT T GT CAACT GAAT TCACACACACACAAATATAG TACCATAGAAAAAGT T TGT 

yf TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
CI AATAAGAAGCTATT TCATTAAGTGTGATATAAACCTCCTCAAACATT TTACTTAGAGGCAAGGAT 

f|j TGT CTAAT TTCAATTGTGCAAGACATGTGCCTTATAATTAT TT TTAGCTTAAAAT TAAACAGAT T 

T TGTAAT AATGTAACTT TGTTAATAGGTGCATAAACACTAATGCAG T CAAT TTGAACAAAAGAAG 
TGACATACACAATATAAATCATATGT CT TCACACGT T GC CTAT ATAATGAGAAGCAGCTCT C TGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATT TT AGGAAGGAAAGGAAC TACGAAATCGTGTGAAAAT GGG T TGG 
AAC CCAT CAGT GAT CGCATAT T CAT T GATGAGGGT T T GC T T GAGATAGAAAATGGT GGCTCCT T T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNY'ICKYEPEINPTAPVEK 
P YLTNQPGDTHQNWVTEAG 1 1 PNL I YWI P T I PLLLL I LVAFGTC C FQMLHKS KGRTKT S PNQ 
STLWISKSTRKESGMEV 



Important features of the protein 
Signal peptide: 

amino acids 1-21 



Transmembrane domain 

hi amino acids 214-235 



N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site 



n amino acids 266-2 69 



N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114,: 140 
145, 212-217 : : ? 



f 
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